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Yes: 


Claims 


1-14, 16-27 




No: 


Claims 


15 
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2. Citations and explanations 
see separate sheet 

VII. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: 
see separate sheet 



VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether th 
claims are fully supported by the description, are made: 

see separate sheet 
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Part V 

a) Relevant documents 

D1 (Science, 265, 1994, 2008-2010): describes the GMS technique (Genomic 
mismatch scanning). 

GMS: after restriction fragmentation of two genomes, the first sample is methylated. 
The samples are melted to produce single-stranded DNA fragments, mixed together 
and reannealed. Mismatched heteroduplexes as well as fully-methylated and fully- 
unmethylated homoduplexes are discarded in order to select regions of the genome 
that are identical-by-descent (p. 2009, col. 2 and 3). 

D2 (BioTechniques, 15(2), 1993, 304-309): Technique to isolate sequences of 
genomic DNA flanking dinucleotide repeat (VNTR) utilizing poly(AC/GT) oligonucleotide 
primers (p. 304, 3. col.) and PCR. A single non-motif nucleotide is added to the 3' end 
of the primer so that it can only perfectly base-pair at the junction between repeat and 
unique DNA sequences (p. 305, 3. col. and Table 1). The method includes the following 
steps: 

- fragmenting genomic DNA by restriction endonucleases 

- ligating fragments into a vector to provide PCR priming sites (adaptors) on each end 
of the fragment 

- amplifying DNA sequences flanking VNTR by a poly(AC) primer and a primer 
complementary to a vector sequence (the adaptor) 

Fig. 1 shows DNA sequences consisting of 

- a single VNTR allele and its flanking regions and an adaptor at the 3'-end and 5'-end 

- 3'-flanking regions of a chosen VNTR sequence carrying an adaptor at the 3'-end 

- 5'-flanking regions of a chosen VNTR sequence carrying an adaptor at the 5'-end. 

D3 (WO 96/17082): A process for detecting polymorphisms between two samples of 
nucleic acid comprising separately treating each nucleic acid sample according to the 
following steps (see claim 12): 

a) digesting the nucleic acid with at least one restriction enzyme 

b) ligating adaptor segments to the ends of the restriction fragments of step a 
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c) amplifying the fragments of step b using a first primer complementary to a sequence 
repeat (VNTR) and a second primer complementary to an adaptor segment. 

Fig. 1a shows the creation of a mixture of 5'-flanking VNTR amplimers and of a mixture 
of 3'-flanking VNTR amplimers. 

D4 (Nature Genetics, 4(1), 1993, 11-17): Genomic mismatch scanning for pairs of 
individuals. Regions of the genome that are identical-by-descent are isolated, based on 
the unique ability of DNA from these regions to form large "heterohybrid" DNA duplexes 
that are free of mismatches. Strands of the mixture are separated and reannealed and 
mismatched duplexes are discarded by the use of nicking enzymes (p. 11-12). 

The same concept is described by D5 (Nature Genetics, 4(1), 1993, 5-6). 

b) Claims 1-10 and 21 

Independent claims 1 and 21 are distinguished from the most relevant prior art, which is 
represented by either of D2 or D3, by relating to adaptor-terminated fragments in which 
each 3'-end is blocked to prevent enzymatic chain extension. Novelty is therefore 
acknowledged (Art. 33(2) PCT). The distinguishing feature ensures that PCR cannot 
start at the adaptor but only at the VNTR sequence (see also figure on page 19 of the 
application) and therefore avoids amplification of adaptor-terminated fragments which 
do not contain the VNTR sequence. The methods according to claims 1 and 21 
therefore escape the non-specific amplification of background that is common to the 
prior art techniques. 

Furthermore, the blocking of the 3'-ends prevents interfragment ligation. 

As none of the available prior art documents gives any indication as to the use of 3'-end 
blocked fragments, an inventive step can be acknowledged for claims 1 and 21 (Art. 
33(3) PCT). 

Claims 2-10 contain preferred embodiments of the novel and inventive idea embodied 
in claim 1 and would therefore also appear to meet the requirements of Articles 
33(2)and 33(3) PCT. 
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c) Claims 11-15 

Claim 1 1 appears to be novel (Art. 33(2) PCT). D2 (see Fig. 1) and D3 (see Fig. 1a) 
which both represent the most relevant state of the art disclose restriction enzyme 
digestion of human genomic DNA, resulting in fragments. Only a part of these 
fragments contain the chosen VNTR, i.e. this mixture does not consist of fragments 
including the chosen VNTR. Selective amplification of fragments containing the chosen 
VNTRs produces sequences including a part of the VNTR and the first flanking regions 
and sequences including a part of the VNTR and the second flanking region. However, 
D2 and D3 do not disclose a mixture consisting of fragments which include the chosen 
VNTR and the flanking regions on both sides. 

Such a mixture is obtained according to the present invention by further method steps 
(see description pages 24-27). As this special mixture has several advantages (it can 
be used for mismatch discrimination) and was not suggested by the prior art, presence 
of an inventive step is acknowledged for claim 1 1 (Art. 33(3) PCT). 

Analogous arguments apply to dependent claims 12 and 13 and to independent claim 
14 which also meet the requirements of Art. 33(2) and (3) PCT. 

Claim 15 does not appear to be novel (Art. 33(2) PCT). D2 and D3 both disclose a 
mixture of 3"-flanking regions of a chosen VNTR sequence carrying an adaptor at the 
3'-end and a mixture of 5'-flanking regions of a chosen VNTR sequence carrying an 
adaptor at the 5'-end. 

d) Claims 16-20, 22-24 and 25-27 

Independent claims 16 and 22 relate to mismatch analysis of polymorphic alleles which 
are representative of those which manifest a trait of interest respectively of those alleles 
which do not manifest the trait. The method is clearly distinguished from the GMS 
(genomic mismatch scanning) technique disclosed in D1, D4 and D5. GMS is applied to 
pairs of individuals to identify genome regions that are identical-by-descent. Two 
individuals, however, do not form a representative sample for a special phenotype. 
Furthermore, the technique according to claims 16 and 22 uses essentially the 
polymorphic alleles, whereas GMS exploits the entire genome rather than just the 
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polymorphic sequences. 

As none of the prior art documents discloses or suggests the method according to 
claims 16 and 22, novelty and an inventive step are acknowledged (Art. 33(2) and (3) 
PCT). 

Claims 17-20 and 23-24 contain preferred embodiments of the novel and inventive idea 
embodied in claims 16 and 22 and would also appear to meet the requirements of 
Articles 33(2) and (3) PCT. 

Analogous arguments apply to claims 25-27 which refer to novel and inventive claims 
and therefore meet themselves the requirements of Art. 33(2) and (3) PCT. 



Part VII 

Contrary to the requirements of Rule 5.1(a)(ii) PCT, the relevant background art 
disclosed in the documents D2 and D3 is not mentioned in the description nor are these 
documents identified therein. 



Part VIII 

a) The feature of claim 26, that the VNTR alleles are immobilised to an array, is not 
referred to in the description. Claim 26 is therefore not supported by the 
description as required by Article 6 PCT. 

b) It appears that claim 1 contains all the technical features of claim 21 and should 
therefore have been made dependent on this claim (Art. 6 and Rule 6.4 PCT). 

c) As an individual apparently has at the most two alleles of a chosen VNTR 
sequence, his genomic DNA does not include a representative mixture of alleles. 
Claim 1 1 should therefore have been amended to relate to several members of a 
species of interest by deleting the reference to one member (Art. 6 PCT). 
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d) The expression "flanking region" used throughout the claims is vague and the 
skilled person does not know which sequence lengths are encompassed by the 
term. Therefore, this expression renders the claims unclear (Art. 6 PCT). 
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amplimers and the mixture of 3'-flanking VNTR amplimers as primers to 
make the desired mixture of VNTR alleles and their flanking regions. 

The species of interest may be any eukaryotic species from 
the plant and animal kingdoms. Although they do not show repetitive 

5 sequences in quite the same way, prokaryotic species are also envisaged. 
An individual member of a species may be for example a plant or a micro- 
organism or an animal such as a mammal. 

In another aspect the invention provides a portion of genomic 
DNA of one or more members of a species of interest, said portion 

10 consisting essentially of a representative mixture of alleles of a chosen 
VNTR sequence and their flanking regions. 

The term "representative mixture of alleles" does not 
necessarily imply that all of the possible alleles, or even most of these 
possible alleles, of a chosen VNTR sequence are present. Whether a 

is particular allele is present or not, e.g. in the mixture generated by the 

method defined above, may depend on the nature of a restriction enzyme 
used in step a) and on other factors. 

The invention also provides a portion of genomic DNA of a 
species of interest, said portion consisting essentially of a representative 

20 mixture of 3'-flanking regions of a chosen VNTR sequence, each member 
of the mixture carrying an adapter at its 3'-end, and a representative 
mixture of S'-flanking regions of a chosen VNTR sequence, each member 
of the mixture carrying an adapter at its 5'-end. 

The invention also provides a method of treating nucleic acids 

25 which consist essentially of a mixture of polymorphic alleles, e.g. of a 
chosen VNTR sequence and their flanking regions, or alternatively a 
mixture generated in some other way such as AFLP, microsatellite-AFLP, 
GMS or RAPD, the mixture being representative of those which manifest a 
trait of interest, which method comprises separating and then re-annealing 

30 strands of the mixture, and 
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4. The method of any one of claims 1 to 3, wherein in step c) 
and/or d) the adaptor or primer used contains at least one 
phosphorothioate bond. 

5. The method of any one of claims 1 to 4, wherein step e) is 
performed using as primers, either successively or together, both the 
mixture of 5 - flanking VNTR amplimers and the mixture of 3 - flanking 
VNTR amplimers. 

6. The method of any one of claims 1 to 5, wherein there is used 
in step e) genomic DNA of one or more members of the species of interest 
which manifest a trait of interest, whereby the resulting mixture of VNTR 
alleles and their flanking sequences is representative of those which 
manifest the trait of interest. 

7. The method of claim 6 wherein in a step f) the strands of the 
mixture of VNTR alleles and their flanking regions are separated and then 
re-annealed and any mis-matches are separated and discarded. 

8. The method of claim 7, wherein step f) is repeated to recover 
a single VNTR allele and its flanking regions. 

9. The method of any one of claims 6 to 8, wherein at least one 
VNTR allele and its flanking sequences representative of those which 
manifest the trait of interest, is hybridised with a mixture of VNTR alleles 
and their flanking sequences representative of those which do not manifest 
the trait of interest, and at least one match and/or at least one mis-match is 
selected to provide at least one VNTR allele or fragment thereof which is 
characteristic of the trait of interest. 

10. The method of claim 9, wherein the at least one VNTR allele 
and its flanking sequences representative of those which manifest the trait 
of interest, is provided with 3'- overlapping ends. 

11. A portion of genomic DNA of one or more members of a 
species of interest, said portion consisting essentially of a representative 
mixture of alleles of a chosen VNTR sequence and their flanking regions on 
both sides. 
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12. The portion as claimed in claim 1 1 , wherein the mixture of 
alleles is representative of those which manifest a trait of interest. 

13. The portion as claimed in claim 11 or claim 12, wherein each 
member of the mixture has an adaptor at each of its 3'-end and its 5'-end. 

5 14. A portion of genomic DNA of one or more members of a 

species of interest, said portion consisting essentially of a single VNTR 
allele and its flanking regions and an adaptor at each of its 3-end and its 
5'-end, said allele being characteristic of those which manifest a trait of 
interest. 

10 15. A portion of genomic DNA of a species of interest, said 

portion consisting essentially of a representative mixture of 3-flanking 
regions of a chosen VNTR sequence, each member of the mixture carrying 
an adaptor at its 3'-end, and a representative mixture of 5'-flanking regions 
of a chosen VNTR sequence, each member of the mixture carrying an 

is ad aptor at its 5-end . 

16. A method of treating nucleic acids which consist essentially of 
a mixture of polymorphic alleles, the mixture being representative of those 
which manifest a trait of interest, which method comprises separating and 
then re-annealing strands of the mixture, and separating and discarding 

20 any mis-matches. 

17. The method of claim 16, wherein the mixture of polymorphic 
alleles is a mixture of alleles of a chosen VNTR sequence and their flanking 
regions. 

18. The method of claim 17, wherein the method is repeated to 
25 recover a single VNTR allele and its flanking regions. 

19. The method of any one of claims 16 to 18, wherein at least 
one VNTR allele and its flanking sequence representative of those which 
manifest the trait of interest, is hybridised with a mixture of VNTR alleles 
and their flanking sequences representative of those which do not manifest 

30 the trait of interest, and at least one match and/or at least one mis-match is 
selected to provide at least one VNTR allele or fragment thereof which is 
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characteristic of the trait of interest. 

20. The method of claim 19, wherein the at least one VNTR allele 

and its flanking sequence representative of those which manifest the trait of 
interest, is provided with 3'-overlapping ends. 
5 21. A method of making a mixture of amplimers which method 

comprises the steps of: 

a) dividing genomic DNA of one or more members of a species 
of interest into fragments, 

b) ligating to each end of each fragment an adaptor thereby 

10 forming a mixture of adaptor-terminated fragments in which each 3'-end is 

L 

blocked to prevent enzymatic chain extension, and 

c) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR primer to 
create a mixture of S'-flanking VNTR amplimers, and/or 

15 d) using a portion of the mixture of adaptor-terminated 

fragments as templates with an adaptor primer and a VNTR antisense 
primer to create a mixture of 3'-flanking VNTR amplimers. 

22. A method of identifying an allele which is linked to a trait of 
interest, which method comprises incubating together under hybridisation 

20 conditions: at least one molecule of nucleic acid containing a polymorphic 
allele and its flanking sequences representative of those which manifest the 
trait of interest; and a mixture of molecules of nucleic acid which contain 
polymorphic alleles and their flanking sequences representative of those 
which do not manifest the trait of interest; and selecting at least one match 

25 and/or at least one mis-match to provide at least one allele or fragment 
thereof which is linked to the trait of interest. 

23. The method of claim 22, wherein the alleles are VNTR alleles. 

24. The method of claim 22 or claim 23, wherein the at least one 
allele and its flanking sequences representative of those which manifest the 

30 trait of interest, is provided with 3 - overlapping ends. 
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25. Use of the portion of genomic DNA as claimed in claim 14 in 
a diagnostic assay. 

26. The method of any one of claims 1 to 10 or 16 to 20, wherein 
the VNTR allele and its flanking regions, or the mixture of VNTR alleles and 
their flanking regions, is analysed by being applied under hybridisation 
conditions to an array of immobilised VNTR alleles and/or their flanking 
regions. 

27. A kit comprising protocols and reagents for performing the 
method of any one of claims 1 to 1 0, 1 6 to 24 or 26. 
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EXTRACTION AND UTILISATION OF VNTR ALLELES 



Glossary of Terms and Abbreviations 



adapter 



10 



15 



25 



AFLP 
allele 

amplimer 



DNA 

DNA fingerprint 

20 GMS 
individual 
heteroduplex 



heterozygygous 



homozygous 

30 locus 

mis-match 



nucleotide sequences, usually comprising annealed 

complementary oligonucleotides, ligated to DNA 

fragments that allow specific amplification and 

manipulation of those fragments 

amplified fragment length polymorphism 

one of several possible alternative sequence variations 

at any one locus 

the product, or pool of products, generated by 
amplification with the adapter primer and an internal 
primer 1 

deoxyribonucleic acid 

the display of a set of DNA fragments from a specific 
DNA sample 

genomic mis-match scanning 

a member of any species subject to investigation 

a duplex of two alleles derived from different individuals, 

sets of individuals or populations 

alleles at the same locus of each of the paired 

chromosomes in a diploid cell being different 

homoduplex a duplex of alleles derived from the same 

individual, set of individuals or population 

alleles at the same locus of the paired chromosomes of 

a diploid cell being identical 

a specific position on a chromosome 

one or more bases in a duplex that fail to form stable 
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10 



NASBA 

PCR 

RAPD 

RDA 

RFLP 

trait 

TRAIT 

VNTR 



15 



hydrogen bonds with opposing bases 

nucleic acid sequence based amplification 

polymerase chain reaction 

random-amplified DNA markers 

representational difference analysis 

restriction fragment length polymorphism 

a distinguishing feature or characteristic manifesting 

itself physically, chemically or biologically 

Total Representation of Alleles that are Informative for a 

Irait 

variable number tandem repeat, also referred to as 
simple sequence repeats (encompassing all repeats of 
two or more nucleotides that may be continuous or 
interrupted by short non-repetitive sequence, including 
minisatellites and microsateilites). 



20 



25 



30 



Field of the invention 

The field of this invention is the detection of polymorphic 
variation in complex genomes, which is the mainstay of the study of 
hereditary traits in all organisms. Since polygenic traits far outweigh those 
that are monogenic, a procedure that allows the isolation in concert of 
several informative polymorphisms within the complex genomes of multiple 
individuals would provide an extremely powerful tool for the investigation of 
hereditary traits. 

The invention differs fundamentally from all other techniques 
that have been previously employed by: 

(i) permitting mass generation of VNTRs quickly and easily from 
DNA 

(ii) generating; polymorphisms that are both linked and 
informative for a trait; 

(iii) reproducing and preserving the polymorphic allele, as it 



i. 
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occurs in the genome; 

(iv) negating problems that are features of other polymerase 

chain reaction based techniques; including miss priming, reaction 
contamination and generation of spurious products; 
5 (v) negating the need for investigations to be confined to families 

of closely-related individuals; 

(vi) permitting the analysis of polygenic traits; 

(vii) having a sparing requirement for DNA starting material. 
The invention therefore represents a major advancement in 

10 the ability of workers in the biomedical fields to screen simple or complex 
genomes, rapidly and with fidelity, for polymorphisms co-segregating with 
advantageous or deleterious monogenic or polygenic hereditary traits. 
There is enormous potential for advancement of medicine, veterinary 
medicine, forensic science, agriculture, animal husbandry and 

15 biotechnology, by the generation of polymorphic markers co-segregating 
with hereditary disease or traits of social or economic importance. The 
invention will also serve to facilitate mutation analysis for all relevant 
organisms. 

20 Introduction 

DNA is a double stranded linear polymer composed of 
repetitions of four mononucleotide units. The sequence in which these 
units are arranged gives rise to a genetic code, referred to as the genome. 
Although the genomes of all individuals within a species are essentially 

25 homologous, subtle variations exist which impart individuality. Locations of 
the genome at which more than one sequence variation may exist are 
termed polymorphisms, each variant of that sequence representing an 
allele. Polymorphisms in gamete-forming germinal cells will be inherited by 
subsequent generations of progeny. By studying the combination of 

30 polymorphisms in the genome of an individual a unique code ('fingerprint') 
can be assigned and the ajncestry of that individual can be determined. 
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Furthermore, a polymorphism found to be linked and co-segregating with a 
particular genetic trait or hereditary disease may be used as a marker for 
genetic screening of that trait or disease in other individuals. 

The study of advantageous or deleterious hereditary traits in 

5 complex genomes has been the subject of considerable interest due to its 
economical, medical and social implications. The establishment of 
protocols that allow the comparison of nucleic acid sequences in complex 
genomes and the isolation of differences unique to a subset of those 
sequences is a fundamental requirement of this field of study. 

io A number of protocols have been used in animals and plants 

for the comparison of nucleic acid sequences and isolation of differences 
between those sequences in individuals. These protocols involve 
restriction fragment length polymorphism (RFLP), random-amplified 
polymorphic DNA markers (RAPD), amplified fragment length 

is polymorphism (AFLP), representational difference analysis (RDA) f genomic 
mis-match scanning (GMS), and linkage analysis of variable number 
tandem repeats (VNTR). These protocols detect polymorphisms by 
assaying subsets of the total DNA sequence variation in a genome. 
Polymorphisms detected by RFLP, AFLP, and RDA rely on the generation 

20 of a fingerprint ladder by gel-electrophoresis which reflects restriction 
fragment size variation. RAPD polymorphisms result from sequence 
variation at primer binding sites and differences in length between primer 
binding sites. GMS polymorphisms result from sequence variation within 
heterohybrid molecules comprising restriction fragments derived from two 

25 related individuals. Linkage analysis involves the detection of length 

variation of variable number tandem repeats (VNTRs) and co-segregation 
of one allele with a trait of interest 

RFLP 

30 RFLP analysis relies on the cleavage of a nucleic acid 

sequence by restriction endonucleases and separation of the resulting 
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fragments by gel electrophoresis. The fragments are blotted onto a 
membrane and hybridized to labelled probes to allow detection of fragment 
length variation. This technique may be of use in the study of a single 
isolated locus or gene fragment, but where an investigation is not confined 
5 to an isolated sequence it is inadequate. Further limitations are that only a 
small number of the polymorphisms generated may be informative, there is 
a high demand for DNA starting material, and the method is labour 
intensive. 

10 RAPD 

RAPD is a commonly used PCR-based polymorphic marker 
technique in genomic fingerprinting and diversity studies, particularly for 
plant species. This technique involves the use of a single 'arbitrary primer 1 
which gives rise to amplification of regions of genome where there is 

15 sufficient homology between the sequences of genomic DNA, in the 5' to 3* 
direction, and that of the arbitrary primer. The amplified products are 
separated by gel electrophoresis. Subtle variations of this method include 
arbitrary primed-PCR (AP-PCR) and DNA amplification fingerprinting 
(DAF). However, the principle of arbitrary priming and amplification of DNA 

20 by PGR for difference analysis is common to all. Advantages compared to 
RFLP are that these methods are more rapid, have a lower demand for 
DNA, and do not require prior knowledge of sequence. A limitation in 
common with RFLP is that each analysis can only compare the genomes of 
two individuals. Although several loci can be evaluated concomitantly by 

25 this method, detection of polymorphisms requires observation of variation 
in band patterns by gel-electrophoresis and is subject to errors of 
superimposition of different alleles of similar electrophoretic mobility. Many 
bands may be faint and difficult to interpret, and it is difficult to achieve 
consistent results in repeat experiments. In common with the majority of 

30 PCR techniques, the results are prone to error by subtle changes in 
reaction conditions, reagent contamination, and the generation of 
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inconsistent banding patterns. This lack of reliability limits the usefulness 
of such techniques in the 'typing 1 of individuals. 

AFLP 

5 AFLP analysis (EP, A, 0534858; Zabeau M et a/.) involves 

restriction endonuclease digestion of DNA and ligation of the generated 
restriction fragments to adapters. Using primers complementary to the 
adapter sequence, the restriction fragments are amplified by PCR t and the 
products are separated by gel-electrophoresis t differences in band patterns 

10 revealing polymorphisms. Microsatellite-AFLP (WO 96/22388; Kuiper M et 
a/.) is a modification of this technique in which two or more restriction 
enzymes, at least one of which cuts at a simple sequence repeat, are used 
to cleave DNA into fragments that are ligated to adapters. The fragments 
are amplified with primers complementary to the adapter sequence. In 

is common with RAPD, several loci can be evaluated concomitantly by this 
method, but detection of polymorphisms requires observation of variation in 
band patterns by gel-electrophoresis and is subject to errors of 
superimposition of different alleles of similar electrophoretic mobility. The 
ability to score bands on an AFLP fingerprint is compromised by generation 

20 of large numbers of bands of which some may be very faint and difficult to 
interpret. Furthermore, the technique is prone to errors that are common to 
all PCR based techniques, summarised above, and suffers from an inability 
to analyse multiple complex genomes simultaneously. This is compounded 
by the generation of bands, by incomplete restriction of the template DNA, 

25 that do not reflect true polymorphisms. AFLP and RAPD analyses 

therefore share many of the same limitations. An additional problem is that 
AFLPs, rather than being evenly dispersed through out the genome, are 
reported to be clustered around centromeres. Consequently, this method 
may not allow the generation of polymorphisms that co-segregate with 

30 sequence differences of interest if they are located at a distance from 
centromeres. This problem is reflected in the reduced rate of 
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polymorphism detection compared to techniques such as linkage analysis. 
Furthermore, the complexity of the experimental data derived by AFLP 
becomes exaggerated with increasing complexity of the genome subject to 
analysis. Consequently, although it has been possible to investigate the 
5 genomes of some plant species by AFLP analysis, the relatively complex 
genomes of higher eukaryotic species may be beyond the useful capacity 
of this technique. 

RDA 

io RDA involves restriction endonuclease digestion of DNA, 

ligation of the fragments to adapters and amplification by PGR. Differences 
between compared genomes are selected by successive rounds of 
subtractive hybridization and kinetic enrichment such that regions of 
difference predominate. This technique is prone to erroneous results 

15 through reaction contamination and generation of spurious products. In 
addition, a fundamental requirement of RDA is the availability of families of 
closely related individuals, some of which are manifesting the trait of 
interest. Where RDA is performed on anything other than closely related or 
highly inbred genomes the multiplicity of differences is too vast for succinct 

20 and useful analysis. 

GMS 

GMS is technique for mapping regions of identity-by-descent 
of two related individuals. The entire genome is compared in a single 

25 hybridisation that has a high demand for DNA since the genomic samples 
are not amplified. Freedom from the need of prior map information, 
conventional markers, oV gel electrophoresis are to its advantage. 
However, the method is; restricted to use on the genomes of only two 
related individuals. " 

30 Restrictiori fragments of the two genomes are hybridised, one 

of which having been methylated such that heterohybrid molecules can be 
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distinguished through their resistance to digestion by Dpn I and Mbo I that 
cleave only fully methylated and unmethylated molecules, respectively. 
Heterohybrids containing homologous strands that lack mis-matches are 
selected and used to probe an array of mapped clones. Although the mis- 

5 match proteins used in this technique may resolve point mutations 

polymorphisms comprising more substantial mis-matches that are beyond 
the limit of this system are not detected. Therefore, in keeping with RFLP, 
AFLP, RAPD, and RDA, GMS tends to resolve binary polymorphisms that 
may have low informative power. 

io In all of the above techniques it is essential that there is a 

difference in nucleotide sequence at or between primer binding sites or 
endonuclease restriction sites in order to detect polymorphisms. This 
highlights the major limitations of these procedures, because in many 
instances a mutation giving rise to a hereditary trait will not create a 

is sequence difference detectable by variation in primer binding or restriction 
enzyme digestion. Consequently, a polymorphism linked to a trait of 
interest will not be identified using these techniques. GMS detects 
polymorphisms that are incidental to the restriction site and is spared some 
of the limitations of the other methods. However, in contrast to VNTR 

20 polymorphisms, the majority of polymorphisms detected by all of these 
techniques are not informative. 

Linkage analysis 

Linkage analysis is an indirect molecular genetic strategy that 
25 involves the systematic comparison of the inheritance of polymorphic 
VNTRs with the trait of interest in families in which that trait is present. 
There are a number of types of VNTR, including minisatellites and 
microsatellites, a feature of all being the repetition of elements of simple 
sequences. They are polymorphic by virtue of variation in the number of 
30 times each element is repeated, giving rise to alleles with variation in 
length. Since several alternative alleles may exist at any one locus, in 
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contrast to polymorphisms based on variation in primer binding or 
restriction enzyme digestion, VNTR polymorphic alleles tend to be highly 
informative. Consequently, where co-segregation of a trait with a particular 
VNTR allele is demonstrated, the allele may be used as a marker for that 

5 trait, or may be used as a vehicle to facilitate identification of the molecular 
genetic basis of the trait. Microsatellites are ubiquitously distributed 
throughout all eukaryotic genomes. Consequently, linkage analysis with 
microsatellites is associated with the highest polymorphism detection rate 
of the genetic screening methods. Indeed, systematic microsatellite 

10 analyses have already been responsible for many advances in the 
understanding of certain types of common cancer. Linkage analysis 
therefore has advantages compared to other related methods of difference 
analysis, the results of which are very reproducible. However, linkage 
analysis is very time consuming, labour intensive and expensive. 

is Furthermore, since many analyses are performed individually the overall 
requirement for DNA is extremely high. This is particularly true if a physical 
map of the genome is unavailable for the selection of informative 
microsatellites that are evenly distributed throughout the genome. The 
demonstration of linkage requires the application of elaborate statistical 

20 programs and powerful computer software for analysis of the experimental 
data. This technique is better suited to monogenic defects since the 
statistical analyses required for multigenic traits are particularly complex. 
Unfortunately, multifactorial genetic traits are far more prevalent than 
monogenic defects, making linkage analysis a cumbersome technique for 

25 the investigation of the majority of hereditary traits. 

The characteristics of an ideal protocol for isolation of 
polymorphisms co-segregating with disease in complex genomes would 
include: 

(i) the ability tb isolate simultaneously and with fidelity the 
30 polymorphisms from complex genomes of several individuals 

(ii) the ability to isolate several polymorphisms simultaneously, 
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permitting the analysis of polygenic traits 

(iii) a high detection rate of polymorphisms that co-segregate with 

sequence differences in all eukaryotic species, including subtle differences 
such as those resulting from point mutations 
5 (iv) no requirement for large families of closely related individuals 

to study traits of interest 

(v) no requirement for physical maps of the genome or prior 
knowledge of genomic sequence 

(vi) a requirement for sparing quantities of nucleic acid samples 
10 for analysis 

(vii) simplicity of use without a need for expensive specialist 
laboratory equipment or computer software 

(viii) potential for widespread application throughout the animal 
and plant kingdoms 

15 (ix) a robust performance with precision, accuracy and fidelity. 

None of the techniques that are currently available fulfil the 
majority of these ideal characteristics. All are compromised by at least one 
of several limitations including: expense; lack of speed; requirement for 
large amounts of DNA; low polymorphism detection rate; an inability to 

20 detect small sequence variations such as point mutations; a lack of fidelity 
with high incidence of artefacts and spurious results; inability to analyse 
several complex genomes concomitantly; an inability to resolve 
simultaneously polymorphisms at multiple loci; an intrinsic need for closely 
related genomes for analysis; a need for prior knowledge of sequence; and 

25 complexity of analysis with a need for expensive equipment and computer 
software. In addition, those techniques that are reliant on large families of 
closely related individuals are further compromised where there are 
discrepancies in lineage, so that paternity testing may be an essential 
preliminary investigation to establish the integrity of each family individual 

30 subject to analysis. 
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The Inventi n 

The invention is a novel method for generating en masse the 
VNTRs from genomic or synthetic DNA, while preserving each allele with 
its flanking sequence. These alleles may be used to produce a 'fingerprint 1 

5 by gel electrophoresis, or they may be used as the starting material in 
protocols for genotyping individuals or protocols for isolation of polymorphic 
markers that co-segregate with hereditary traits. The latter may be 
achieved by mis-match discrimination to yield a pool of alleles that are 
common to all individuals manifesting a particular trait. Further mis-match 

10 discrimination of these selected alleles with the alleles of individuals in 
which the trait is not present, in solution or fixed to an array, allows 
purification of VNTRs with alleles that are both linked and informative for 
the particular trait. The end products, therefore, are designated a Iotal 
Representation of Alleles Informative for a Irait (TRAIT). 

is In one aspect the invention provides a method of making a 

mixture of VNTR alleles and their flanking regions of the genomic DNA of 
one or more members of a species of interest, which method comprises the 
steps of: 

a) dividing genomic DNA of the species of interest into 
20 fragments, 

b) ligating to each end of each fragment an adapter thereby 
forming a mixture of adapter-terminated fragments in which each 3-end is 
blocked to prevent enzymatic chain extension, 

c) using a portion of the mixture of adapter-terminated 

25 fragments as templates with an adapter primer and a VNTR primer to 
create a mixture of 5-flanking VNTR amplimers, 

d) using a portion of the mixture of adapter-terminated 
fragments as templates with an adapter primer and a VNTR antisense 
primer to create a mixture of 3'-flanking VNTR amplimers, 

30 e) and using genomic DNA of the one or more members of the 

species of interest as template with the mixture of 5-flanking VNTR 
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amplimers and the mixture of 3-fianking VNTR amplimers as primers to 
make the desired mixture of VNTR alleles and their flanking regions. 

The species of interest may be any eukaryotic species from 
the plant and animal kingdoms. Although they do not show repetitive 

5 sequences in quite the same way, prokaryotic species are also envisaged. 
An individual member of a species may be for example a plant or a micro- 
organism or an animal such as a mammal. 

In another aspect the invention provides a portion of genomic 
DNA of one or more members of a species of interest, said portion 

10 consisting essentially of a representative mixture of alleles of a chosen 
VNTR sequence and their flanking regions. 

The term "representative mixture of alleles" does not 
necessarily imply that all of the possible alleles, or even most of these 
possible alleles, of a chosen VNTR sequence are present. Whether a 

15 particular allele is present or not, e.g. in the mixture generated by the 
method defined above, may depend on the nature of a restriction enzyme 
used in step a) and on other factors. 

The invention also provides a portion of genomic DNA of a 
species of interest, said portion consisting essentially of a representative 

20 mixture of 3-flanking regions of a chosen VNTR sequence, each member 
of the mixture carrying an adapter at its 3-end. 

The invention also provides a portion of genomic DNA of a 
species of interest, said portion consisting essentially of a representative 
mixture of 5-flanking regions of a chosen VNTR sequence, each member 

25 of the mixture carrying an adapter at its 5 r -end. 

The invention also provides a method of treating a mixture of 
polymorphic alleles, e.g. of a chosen VNTR sequence and their flanking 
regions, or alternatively a mixture generated in some other way such as 
AFLP, microsatellite-AFLP, GMS or RAPD, the mixture being 

30 representative of those which manifest a trait of interest, which method 
comprises separating and then re-annealing strands of the mixture, and 
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separating and discarding any mis-matches. Preferably the method 
comprises the additional step of hybridizing the said mixture with a mixture 
of corresponding polymorphic alleles, e.g. of the chosen VNTR sequence 
and their flanking regions, or alternatively a mixture generated in some 
5 other way such as AFLP, microsatellite-AFLP, GMS or RAPD, which are 
representative of those which do not show the trait of interest, and selecting 
mis-matches to provide a mixture of polymorphic alleles which are 
characteristic of the trait of interest. 

The invention also provides kits comprising protocols and 
10 reagents for performing the methods herein described. 

The salient points of the invention may be represented as 

follows: 

(i) reduction in the complexity of the genome by double positive 
selection of genomic DNA restriction fragments that both ligate to a chosen 

is adapter and contain a sequence with homology to a chosen primer, 

employing enrichment of such products by PCR, NASBA or other methods; 

(ii) introduction of the selected enriched fragments to a genomic 
template in such a way that allows recreation of the VNTRs with the 
flanking sequences within that template, whilst preserving the allele and 

20 therefore the informativeness of each locus; 

(iii) mis-match discrimination of the generated VNTR alleles to 
remove any spurious products of amplification that occur through miss 
priming events, reaction contamination, and subtle variation in reaction 
conditions; 

25 (iv) selection of only those synthesised VNTRs alleles that are 

common to all individuals manifesting a particular trait or those alleles that 
predominate in such a group of individuals. This is achieved by strand 
dissociation and hybridization, giving rise to mis-match containing 
heteroduplexes of alleles at any locus that differ among the individuals. 

30 These complexes can be rejected by mis-match discrimination. The 
enriched alleles that are common to individuals manifesting the trait or 
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predominate in that group are sufficiently pure to be used as starting 
material in other DNA based studies that utilise polymorphic alleles; 
(v) rejection of those alleles common to all individuals 

manifesting a particular trait or predominating in such a group that are also 

5 common to individuals in which the trait is not present. This is achieved by 
strand dissociation and hybridization of the VNTR alleles that are common 
to individuals manifesting a particular trait of interest or predominating in 
that group with the VNTR alleles of individuals in which the trait is not 
present followed by a further round of mis-match discrimination. In this 

10 case mis-match containing heteroduplexes and homoduplexes derived 
from the individuals manifesting the hereditary trait are selected. These 
represent polymorphic VNTRs with an informative allele that co-segregates 
with the particular trait of interest. Amplification of these VNTRs from the 
DNAs of individuals manifesting the trait of interest yields the informative 

is alleles that may be used as DNA markers. 

The invention provides a method of selecting genetic 
elements that are common to one pool of individuals but are absent in a 
second or present at a lower level. An obvious variation on this theme is 
the selection of genetic elements that are absent in one pool of individuals 

20 but are present in a second by judicious selection, during the course of the 
procedure, of allele duplexes that are either with or without a mis-match. 

For simplicity, the protocol may be considered in three 
separate sections: generation of VNTR alleles; mis-match discrimination; 
and selection of alleles informative for a trait. The text is illustrated with a 

25 number of diagrams to facilitate description of the invention. 

Generation of VNTR alleles 

The protocol describes a method of generating with fidelity 
the VNTR alleles with their flanking sequences en masse from the genomic 
30 DNA of one individual, or the pooled DNAs of several individuals. The 
initial step involves fragmentation of the genomic DNA physically, 
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chemically or enzymatically, the aim of which is to obtain genomic 
fragments that contain VNTRs all of which being of an amplifiable length. 
The use of one or more restriction enzymes gives rise to uniform 
fragmentation of the genomic sample and constitutes the preferred 

5 technique. With judicious choice of restriction enzymes that cut frequently 
there is potential for generation en masse of every VNTR of the chosen 
type within a genome or pool of genomes since virtually all fragments will 
be sufficiently small for efficient amplification. It should be noted that the 
phenotype of individuals contributing genomic DNA for this fragmentation is 

10 unimportant. Indeed, the genomes restricted in this way need not be 
derived from any individual, or pool of individuals, that have been selected 
by virtue of their phenotype for investigation of a particular trait of interest. 



Genomic DNA of one or more individuals of the species of interest containing VNTRs ( ^|||gg|j) 

| Restriction enzyme(s) 



15 

The restriction fragments are ligated to an adapter by which 
the fragments may be amplified or manipulated. The sequence of the 
longer oligonucleotide contained within the adapter is chosen such that it 
fails to generate any products when added as the primer to an amplification 

20 reaction containing genomic DNA as template. Termini are introduced 
physically, chemically or enzymatically to all available 3' ends to prevent 
their extension under the influence of a DNA polymerase. They may be 
introduced in one of several ways including: (A) addition of the terminus 
prior to ligation; (B) addition of the terminus following ligation; (C) addition 

25 of the terminus during ligation. The spectrum of available termini that are 
suitable for this purpose include, but are not limited to, dideoxynucleotide 
triphosphates. 
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(A) A method by which termination may be achieved of all 3' ends 

with dideoxynucleotide triphosphates prior to ligation is through the action 
of a DNA polymerase, including Terminal deoxynucleotidyl transferase, in 
the presence of a chosen dideoxynucleotide triphosphate. 
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1\ Terminal deoxynucleotidyl 
transferase and a ddNTP 
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Ligation then follows with an adapter containing an 
appropriate 5' recess that accommodates the dideoxynucleotide 
10 triphosphate terminus on each strand. 
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T4 DNA ligase 
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(B) A method by which termination may be achieved of all 3' ends 

15 with dideoxynucleotide triphosphates following ligation is through the action 
of a DNA polymerase in the presence of a chosen dideoxynucleotide 
triphosphate. 
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T4 DNA ligase 



DNA polymerase + a ddNTP 



W/////////. 



(C) A method by which the ligated 3' ends can achieve 

termination during the ligation process is through incorporation of a suitable 

5 3' terminus and a 5' phosphate on the shorter oligonucleotide during its 
synthesis such that this oligonucleotide will form a covalent bond with the 
genomic fragments under the influence of a enzyme such as T4 DNA 
ligase. Again, suitable termini include but are not limited to 
dideoxynucleotide phosphates, there being a variety of other modifications 

10 and deoxynucleotide analogues that will prevent extension of the 3' ends 
under the influence of a DNA polymerase. 



DNA ligase 



drffeaaiftl V/J///////A 



15 

Of these, method (A) was found to be the most reliable since 
every genomic fragment that achieves ligation to an adapter is guaranteed 
to have an appropriate terminus. In addition, it guarantees that inter- 
fragm nt ligation is impossible. Method (C) also guarantees that each 
20 ligated 3' end possesses a terminus. However, unlike in the case of 
method (A), inter-fragment ligation can occur. 
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Since it is likely that some fragments will contain sites at 
which one DNA strand is nicked, in order to prevent polymerisation from 
these sites it is preferable to incorporate into them suitable termini. This 
may be achieved in a number of ways including, but not limited to, the 

5 incubation of the terminated and ligated genomic fragments with a DNA 
polymerase in the presence of all dideoxynucleotide triphosphates. 

The longer oligonucleotide that is contained within the 
adapter may be used as the adapter primer in amplification reactions 
containing the genomic fragments that have been appropriately ligated and 

10 blocked by addition of termini at all potential sites of polymerisation. 
However, in the absence of 'internal 1 priming from another nucleotide 
sequence, the amplification of DNA is impossible. However, if another 
nucleotide sequence successfully anneals and achieves polymerisation to 
the limit of the adapter, an adapter primer binding site is created. Binding 

15 of the adapter primer will allow polymerisation of DNA to the limit of the 
annealed nucleotide sequence. If the nucleotide sequence represents a 
primer, or represents a nucleotide sequence containing a primer binding 
site, introduction of the adapter primer and the Internal primer* allows 
specific exponential amplification of products only from those fragments 

20 that successfully ligated to the adapter and contain DNA homologous to 
that of the annealed nucleotide sequence. 

If an oligonucleotide with sequence homology to a chosen 
VNTR is used as the internal primer, only those fragments that have ligated 
successfully to the adapter and contain the targeted VNTR will be capable 

25 of amplification. This gives rise to 'amplifiers' that flank each VNTR, 
comprising genomic sequence limited by a restriction site for the chosen 
restriction enzyme and VNTR sequence with homology to the chosen 
VNTR primer. 
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Annealing of a VNTR primer ( m& ) 



fcK^KM I dd 



dd l - V/////////A - 



Extension of the annealed primer 
by a DNA polymerase creates 
an adapter primer binding site 



Introduction of the adapter primer 
( \wmmmm ) and amplification 
generates amptimers that 
constitute one flank of all VNTRs 



A number of different types of VNTR sequence have been 
identified in a diverse range of species. These include, amongst others, 
the dinucleotide repeats, trinucleotide repeats and the tetranucleotide 

10 repeats. Since the (AC)n dinucleotide repeat constitutes the most common 
VNTR that occurs in the majority of species, primers of appropriate 
sequence to generate amplimers for this VNTR may be chosen. It can be 
seen that the introduction of an (AC)n primer will give rise to amplimers that 
represent one flank of the VNTRs, and introduction of a (GT)n primer will 

1 5 give rise to amplimers that represent the other flank of these VNTRs. 
However, VNTRs with long repeat lengths will be over represented in the 
amplimer pool relative to shorter VNTRs by virtue of their greater number of 
primer binding sites. Similarly, the longer alleles will be over represented 
relative to the shorter alleles of the same VNTR due to their greater number 

20 of primer binding sites. This problem is negated by the introduction of 
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degenerate 3' ends on the VNTR primers that prevent polymerisation of the 
annealed primers unless they are aligned with the start of the flanking 
sequence. The amplification of all VNTRs and all alleles, therefore, will not 
be biased by their repeat lengths. In the case of (AC)n dinucleotide 
5 repeats the following primers may be used: 

(AC)nB, where B = C + G + T 
(CA)nD, where D = A + G + T 
(GT)nH, where H = A + C + T 
io (TG)nV, where V = A + C + G 

Alternatively, amplimers of other VNTR sequences may be 
generated in this manner by introduction of the appropriate target-specific 
primer containing a degenerate 3' end. Indeed, amplimers constituting 

is genomic sequence that contain or flank any target-specific nucleotide 
binding site may be generated in the same way. 

In the case of (AC)n dinucleotide repeats, the amplimers 
derived from reactions primed by the (AC)nB and (CA)nD degenerate 
oligonucleotides may be pooled. An obvious alternative is to generate an 

20 amplimer pool by priming amplification reactions with the (AC)nB and 

(CA)nD degenerate oligonucleotides together. However, this is likely to be 
less efficient than performing the reactions separately. Similarly, the 
(GT)nH and (TG)nV primed reactions may be pooled, or reactions 
containing both of these degenerate primers may be performed. Thus, two 

25 amplimer pools may be created, each representing sequences from only 
one flank of each VNTR. 



Amplimers constituting the 5' f tank of al! VNTRs Ampfimers constituting the J flank of all VNTRs 
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Since only one of the two flanking sequences of all VNTRs is 
generated in each amplimer pool, the full allele length being absent, the 
products of amplification are non-informative. However, the full length 
alleles, together with their flanking sequences, can be recreated with fidelity 

5 en masse from genomic DNA by hybridisation of the amplimers to that 
genomic DNA and subsequent polymerisation of the annealed sequences. 
As such, the full length 'affected 1 VNTR alleles of individuals manifesting a 
particular trait of interest may be obtained by hybridisation of the amplimers 
to the genomic DNAs of those individuals. Similarly, the reciprocal reaction 

10 for individuals in which that trait is absent will give rise to the generation of 
full length 'wild type' VNTR alleles and flanking sequences as they occur in 
the genomes of those individuals. Thus, two pools of VNTRs can be 
generated containing alleles derived from 'affected' DNA and alleles 
derived from 'wild type' DNA. A DNA polymerase that is highly processive 

15 is preferred in this application in order to minimise the potential for 
generation of 'stutter bands' that result from strand slippage during 
polymerisation. 

To limit the potential for generation of spurious products by 
'cross-talk' that occurs through the non-specific association of amplimer 

20 strands during hybridisation, it is preferable to remove the VNTR repeat 
sequences from the amplimers since these repeat sequences will be 
responsible for the majority of such cross-talk. This may be initiated in a 
number of ways including, but not limited to, (A) digestion by an enzyme 
with 3' to 5' exonuclease activity; (B) digestion by an enzyme with 5' to 3' 

25 exonuclease activity ; (C) digestion by Uracil DNA glycosylase of an 
amplimer pool generated with primers containing uracil; (D) digestion by 
RNase of an amplimer pool generated with an RNA primer. 
(A) Providing the 5' end of the adapter primer has all four 

nucleotides represented the opposing strand will be similarly endowed. As 

30 such, incubation with an e'nzyme with 3' to 5' exonuclease activity, such as 
T4 DNA polymerase at 12>C in the presence of only two deoxynucleotide 
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triphosphates, will not lead to significant shortening of the 3' strand 
complementing the adapter primer. The 3 r strand complementing the 
VNTR primer, however, will be removed by T4 DNA polymerase if the 
reaction occurs in the presence of the deoxynucleotides that it lacks. 

5 Exonuclease digestion by the enzyme will cease when the first 

deoxynucleotide that is present in the reaction mixture is encountered. The 
5' overhang that is created may be digested with a single strand specific 
exonuclease or endonuclease, including but not limited to Exonuclease VII, 
such that all repeat sequence is removed. The illustration depicts a 

10 scenario for (AC)n and (GT)n primed amplimers: 



□ <AC)n 3' 5' (AC)nE 
-B(TG)n 5* 3* (TG)nC 



IT4 DNA polymerase 
+ dGTP + dTTP 



T4 DNA polymerase 
+ dATP + dCTP t 



3(TG)n 



5' (AC)n C 



Exonuclease VII 



Exonuclease VII 



is If a trinucleotide VNTR has been targeted appropriate 

digestion by T4 DNA polymerase in the presence of only one 
deoxynucleotide will be required. For tetranucleotide repeats this method 
is inappropriate and another should be adopted. 
(B) The repeat sequence may be digested with a 5' to 3' 

20 exonuclease, such as T7 gene 6 exonuclease. Phosphorothioate bonds 
retard the activity of this enzyme. Four successive bonds are believed to 
inhibitory. Therefore, if the adapter primer has been synthesised with at 
least four phosphorothioate bonds at its 5' end, if not synthesised 
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compietely with phosphorothioate bonds, it will be resistant to the 5' to 3' 
exonuclease activity of T7 gene 6 exonuclease. If the VNTR primers are 
synthesised with four phosphorothioate bonds at their 3' ends, the action of 
T7 gene 6 exonuclease will digest the VNTR primer leaving four 

5 nucleotides of repeat sequence. The complementary sequence may be 
digested by a single strand specific exonuclease orendonuclease, 
including but not limited to Exonuclease I, such that all repeat sequence is 
removed from the amplimers apart from four nucleotides in each strand. 
Such a short length of repeat sequence is unlikely to invite the generation 

10 of spurious products by non-specific interaction of strand ends during 
hybridisation. 



ail % 



T7 gene 6 
exonuclease 



T7 gene 6 
exonuclease 



3* VZZS1 



17 gene 6 
exonuclease 



5' ^1 
31 V77?T 



T7 gene 6 
exonuclease 



(C) Synthesis of uracil containing VNTR primers, e.g. (GU)nH 

15 and (UG)nV, allows the destruction of these primers in the appropriate 
amplimer pool by the action of Uracil DNA glycosylase. Incubation of the 
digested amplimers with a single strand specific endonuclease, including 
but not limited to S1 nuclease, leads to further digestion of the VNTR 
primers that contains single stranded spaces and ultimately to the removal 
20 of the complementary sequence such that all repeat sequence is removed. 



WO 98/42867 



PCT/GB98/00840 



-24- 



](AC)n 3' 
3(UG)n 5' 



Uracil DNA glycosytase 

n (AC)n 3' 
IZD( G)n 5' 



S1 nuclease 



(D) The generation of amplimer pools with RNA primers based on 

VNTR sequence, using a DNA polymerase with reverse transcriptase 

5 activity, permits the destruction of the VNTR primers by the action of 

RNAse. The complementary sequence may be removed by a single strand 
specific exonuclease or endonuclease. 

There are several methods by which the digested amplimers 
may be hybridised to the genomic DNA of one or more individuals to 

io generate en masse and with fidelity the VNTR alleles as they occur in that 
template. These include (A) hybridisation and polymerisation of the 
amplimer pools, either separately in succession or together to genomic 
DNA that may or may not have been fragmented; (B) hybridisation and 
polymerisation of the amplimers constituting only one flank of each VNTR 

is to genomic DNA that has been fragmented physically, chemically or 

enzymatically, and then terminated and ligated to an adapter which may or 
may not be the one used to generate the amplimer pools. In each case, 
the addition of one of many hybridisation accelerators will enhance the rate 
of hybridisation. Particularly under stringent conditions of hybridisation the 

20 use of such accelerators may be preferable. The number of methods by 
which hybridisation may be accelerated is vast but includes the 
incorporation of phenol exclusion, cationic detergents such as cetyl 
trimethylammonium bromide (CTAB), and volume excluding agents such 



WO 98/42867 



PCT/GB98/00840 



25- 



as dextran sulphate. It should be noted that if CTAB is the chosen 
hybridisation accelerator the salt concentrations in the hybridisation mixture 
should be low in order to prevent its precipitation. 
(A) Illustration is given for hybridisation of one amplimer pool to 

genomic DNA to permit the reproduction of VNTR alleles in that genomic 
template by a DNA polymerase: 



i 



Hybridisation of the 5' VNTR amplimers to 
genomic DNA of one or more individuals 



V//MWA 



Extension of the annealed 3' end 
by DNA polymerase 



10 



Hybridisation of the second amplimer pool permits 
amplification of all VNTR alleles en masse using the adapter primer: 
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^"^5^ 1 













Melting and cooling allows the 3' 
flanking amplimerto anneal to the 
extended strand 



3m 



The VNTR allele and opposing 
flanking sequence is copied by 
DNA polymerase 



3 *C=^ IF ^ ff 



gfcftMl I 13* 



Amplification of VNTRsfrom the 
genomic DNA under investigation on 
introduction of the adapter primer and 
thermal cycling 
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(B) Illustration is given for hybridisation of one amplimer pool to 

genomic DNA that has been fragmented, terminated and ligated to an 
adapter that may or may not be the same as that as that present in the 
amplimer pools: 



ddC 



□dd 



3^ 



Hybridisation of the 5" VNTR 
amplimers to genomic DNA 
of one or more individuals 



isms 1 



□dd 



Extension of the annealed 3 a end 
by ONA polymerase 



□dd 



ddC 



V/S//MM 



Amplification of the VNTRs from the 
genomic DNA under investigation 
on introduction of the adapter primer 
and thermal cycling 



Removal of repeat sequence from the amplimers permits 
io concomitant hybridisation of both amplimer pools to genomic DNA while 
limiting the possibility for generation of spurious products through non- 
specific strand association. The generation of spurious products is reduced 
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further by hybridising the amplimers that constitute each flank separately in 
succession. This allows the introduction of further steps to control non- 
specific strand association including the removal of non-hybridised strands 
by incubation with a single strand specific exonuclease or endonuclease 

5 between hybridisations. In the preferred technique only one amplimer pool, 
comprising one flank of each VNTR, is hybridised to terminated and 
adapter-ligated genomic fragments. As such, this negates any possibility of 
non-specific association between amplimer strands of different pools. If 
each amplimer pool is hybridised and polymerised separately in this 

10 manner, the products that are generated in each reaction should be 
identical. Therefore, these products may be combined. 

Hybridisation of the amplimers to the pooled genomes of 
several individuals allows the generation of the VNTR alleles that they 
contain. If this is performed on the pooled genomes of individuals 

15 manifesting a particular trait, and also on those of individuals lacking the 
trait r the Effected 1 and 'wild type 1 alleles that are present in those pooled 
genomes can be synthesised. 

It is preferable to select the affected individuals from a 
defined population such that the same genotype is common to all 

20 individuals of a given phenotype. However, even if these individuals are 
selected from an out-bred population for which there are several genotypes 
that produce a single phenotype, the alleles that co-segregate with the trait 
loci will be present at a higher frequency in the pooled genomes of affected 
individuals than in the reciprocal pooled genomes of wild type individuals. 

25 These alleles will be enriched by successive repetitions of mis-match 
cleavage and amplification. To prevent the allele frequencies from being 
artificially skewed it is preferable to have a large number of individuals 
contributing genomic DNA to each pool. This ensures that the allele 
frequencies in the affected group and wild type group tend to equate to the 

30 general population from which they are derived such that disparity in the 
two is a consequence of linkage disequilibrium with the trait and not 
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another factor. However, if the numbers of affected and wild type 
individuals is limited the selection of matched sibling pairs, one member of 
each pair being affected and the other being a wild type individual, will go 
some distance to balance the allele frequencies of the pooled genomes 
5 other than with respect to the particular trait. 

Mis-match discrimination 

If the VNTR alleles that are generated from the affected 
individuals and the wild type individuals are denatured and allowed to re- 

10 anneal in separate reactions duplex DNA molecules with or without mis- 
matches will result. Due to the VNTR-specific flanking sequences and 
stringent conditions of hybridisation, only alleles that are of the same VNTR 
will re-anneal. Therefore, duplexes possessing mis-matches contain 
alleles of the same VNTR that are of unequal size or they contain spurious 

15 products of amplification. Alleles of similar size that re-anneal will form 
perfect duplexes. 

The molecules that contain a mis-match may be digested with 
an enzyme that acts upon single stranded DNA or an enzyme that is able 
to detect conformational irregularities in DNA. Suitable enzymes include 

20 but are not limited to S1 nuclease and T4 endonuclease VII. 
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Alleles shared by all individuals Alleles that differ among individuals 



Denature and re-anneal 



T4 endonuclease VII 



I 1 V///////A mum 



Of these two enzymes, T4 endonuclease VII has proved to be 
the most reliable and efficient enzyme in this application and has been 
found to digest efficiently in a range of DNA polymerase buffers while 
tolerating carry-over of CTAB from the hybridisation reaction, It cleaves 

10 both strands of a mis-match containing molecule leaving staggered ends, 
each strand being cleaved 3' with respect to the mis-match. 

Cleavage is likfely to occur within the repeat sequence 
creating ends that may interjact non-specificaliy during the subsequent 
amplification process and resulting in the generation of spurious products. 

15 To obviate this problem the repeat sequences may be digested from the 
cleaved duplexes. This may be achieved in a number of ways, including 
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(A) by the action of a 3* to 5' exonuclease including but not limited to 
Exonuclease III, together with a single strand specific exonuclease or 
endonuclease, having protected all DNA strands prior to T4 endonuclease 
VII digestion with protective termini including but not limited to a- 

5 thiophosphate groups or a 3' overhang; (B) by the action of a 5' to 3' 
exonuclease including but not limited to T7 gene 6 exonuclease, together 
with an exonuclease or endonuclease, having protected all DNA strands 
prior to T4 endonuclease VII digestion with protective groups including but 
not limited to phosphorothioate bonds incorporated in to the adapter primer. 

io By inclusion of phosphorothioate bonds in the adapter primer 

the 5 ! ends of all molecules containing the adapter primer will be resistant 
to the 5' to 3' exonuclease activity of T7 gene 6 exonuclease. However, the 
5' ends created by T4 endonuclease VII cleavage will be susceptible to this 
enzyme. 




17 gene 6 exonuclease + single strand 
specific exonuclease or endonuclease 



I 1 V/JJ////A ~EB$?m 



Amplification 
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It is possible that some molecules will escape complete 
cleavage by T4 endonuclease VII acquiring merely a single stranded nick. 
However, such nicks are susceptible to digestion by T7 gene 6 
exonuclease, though only the nicked strand would be digested if this 

5 enzyme was used in concert was a single strand specific exonuclease. On 
the other hand, a single strand specific endonuclease, including but not 
limited to S1 nuclease, would cleave the complementary single strand that 
is exposed by action of T7 gene 6 exonuclease in molecules receiving 
single stranded nicks such that both strands become disrupted. Thus, 

10 enzymes such as S1 nuclease in concert with T7 gene 6 exonuclease 
would lead to the complete digestion of all T4 endonuclease VII digested 
molecules irrespective of whether one or both strands was cut. 

S1 nuclease has proven successful in this role, being capable 
of efficient digestion of single stranded DNA under alkaline conditions 

is created by the 77 gene 6 exonuclease buffer. However, some non-specific 
digestion of DNA may occur with this enzyme. Since those molecules 
receiving single stranded nicks by the action of T4 endonuclease VII are 
likely to be few, it may be preferable to use a single strand specific 
exonuclease that is less likely to act in this way. Among such enzymes are 

20 included Exonuclease I and Exonuclease VII. Molecules that lack a mis- 
match are resistant to this regime of digestion and may be enriched by 
amplification. In order to minimise the generation of 'stutter bands' that 
result from strand slippage and polymerase errors during the amplification 
reaction, the number of cycles of amplification should not exceed that 

25 which gives adequate yields of product. 

In addition to T7 gene 6 exonuclease, Exonuclease III may 
act at nicks in DNA molecules. In the absence of phosphorothioate bonds 
within the adapter primer this enzyme would create long 3' overhangs in 
nicked molecules on digestion to completion. Therefore, inclusion of a 

30 single strand specific endonuclease or exonuclease that would remove 
these overhangs would allow the elimination of the cleaved molecule 
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irrespective of whether T4 endonuclease VII disrupted one or both strands 
in a mis-match containing duplex. However, in order to obviate the need 
for the additional step comprising protection of the 3' ends of all DNA 
molecules prior to mis-match cleavage the use of T7 gene 6 exonuclease is 

5 preferred since protection of the 5' ends that is required for use of this 
enzyme is easily achieved by incorporation of phosphorothioate bonds into 
the adapter primer. 

Another method by which cleaved molecules could be 
removed is by addition of a hapten, including but not limited to biotin-16- 

10 dUTP, at the sites of cleavage followed by physical separation of the 
cleaved molecules by the affinity of the hapten to another chemical. This 
could be achieved by termination of the 3' ends of all molecules prior to the 
mis-match cleavage procedure such that they are inert in the presence of a 
DNA polymerase. Suitable termini include but are not limited to 

15 dideoxynucleotide triphosphates which may be incorporated by a DNA 
polymerase including but not limited to Terminal deoxynucleotidyl 
transferase. Subsequent incubation of the cleaved molecules with biotin- 
16-dUTP in the presence of a DNA polymerase, such as Terminal 
deoxynucleotidyl transferase, will give rise to biotinylation of only those 

20 molecules which lack terminated 3' ends. Separation of the biotinylated 
molecules through binding to streptavidin could then follow. 

In a similar manner, since molecules cleaved by T4 
endonuclease VII have a 3' overhang these molecules could be removed 
through capture by single stranded binding proteins or chemicals that 

25 possess an affinity for single stranded DNA. It is likely that the overhang 
created by T4 endonuclease VII will be too small for efficient selection of 
the cleaved molecules by this method. However, they could be lengthened 
specifically by incubation with a DNA polymerase, including but not limited 
to Terminal deoxynucleotidyl transferase in the presence of one or more 

30 deoxynucleotide triphosphates, having terminated all 3' ends of the DNA 
molecules prior to mis-match cleavage with suitable termini that render 
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them inert in the presence of a DNA polymerase. 

Physical separation of DNA molecules is cumbersome and 
relatively inefficient compared to separation by enzymatic means. 
Furthermore, the removal of molecules that possess single stranded nicks 

5 is likely to be unsuccessful. For these reasons methods of enzymatic 
differentiation of DNA species is preferred. 

Reiteration of several rounds of denaturation, hybridisation 
and mis-match cleavage successfully eliminates all spurious products of 
amplification. Furthermore, it reduces to homozygosity all VNTRs such that 

10 only the most common allele of each VNTR remains, or it tends to eliminate 
those VNTRs for which many alleles are present with equal frequency. 
Rapid transition from the temperature of denaturation to that of annealing is 
required to prevent preferential annealing of identical sized alleles. This is 
may occur if the transition from the denaturation temperature to the 

15 annealing temperature is protracted. A hybridisation accelerator may be 
included to enhance the efficiency of hybridisation. This process carried 
out in parallel for the 'affected 1 VNTR alleles as well as the 'wild type 1 VNTR 
alleles will tend to achieve identical reduction to homozygosity and the 
generation of balanced allele frequencies. However, for a number of 

20 VNTRs the allele frequencies in the affected and wild type groups at the 
end of the mis-match cleavage procedure will be significantly different 
Providing that the trait of interest is the only feature distinguishing the two 
groups of individuals from which the VNTRs were derived alleles that are 
over represented in the affected group relative to the wild type group must 

25 co-segregate with that trait. These are markers of the trait and should be 
selected. 

The effect of reiterated mis-match cleavage on the allele 
frequencies of a VNTR can be illustrated with a basic scenario ignoring the 
efficiency of digestion, the effects of polymerase errors and the second 
30 order kinetics of hybridisation. Consider a VNTR for which three alleles are 
present as follows: 
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A 


B 


c 
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1/ 


1/ 
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4 


4 


2 


1 


1 



STARTING SCENARIO 
Alleles 

Allele frequency 
Ratio 



If the alleles are denatured and allowed to re-anneal duplex 
molecules with or without a mismatch will result. The proportion of each 
allele that forms a perfect duplex will depend on its allele frequency. All 
io mis-match containing molecules theoretically would be susceptible to 

digestion by T4 endonuclease VII and would be eliminated. Thus, after the 
first round of mis-match cleavage the amounts and ratios of each allele 
remaining would be: 

15 Alleles A B C 

Amount remaining 4/ i6 1/ ie 1 'ie 

Total remaining 6/ i6 

Ratio 4 1 1 



Allele frequency 4/ 6 1/ e V e 



After a second round of mis-match cleavage the allele 
frequencies would change further: 

Alleles ABC 

^ . . 16/ 1/ 1/ 

25 Amount remaining 36 36 36 

18/ 

Total remaining 36 

Ratio i 16 1 1 

. ., . , i 16/ 1/ 1/ 

Allele frequency ia is is 

I 

30 After the 3rd round the theoretical allele frequencies would be 

as follows: j 
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Allel s ABC 

Amount remaining 256/ 324 1/ 324 1/ 324 

Total remaining 258/ 324 

Ratio 256 1 1 

5 Allele frequency 256/ 25a 1/ 258 1/ 258 

Therefore, after two rounds one allele would predominate 
markedly. After a further round this allele would be present virtually 
exclusively. The ratio of the total amount of this VNTR remaining, relative 
10 to a VNTR for which there was only one allele prior to mis-match cleavage, 
would be: 

6/ ^Y 18/ v 258/ • 1/ V V V 1/ 

16 X 36 X 324 • 1 X 1 X 1 

-43/ . * 
288 - T 

15 In the same way the most common allele of any VNTR will 

predominate after a sufficient number of rounds of mis-match cleavage. 
Four rounds may be sufficient to reduce the VNTRs to near homozygosity, 
but the efficiency of enzyme digestion, the generation of polymerase errors 
and the kinetics of hybridisation are factors that will influence this. Disparity 

20 in the allele frequencies of affected and wild type VNTRs will lead to 

enrichment of different alleles in each group if the imbalance is sufficiently 
large. Such alleles are informative for the trait of interest but must be 
selected from other enriched alleles that may be identical in both the 
affected and wild type groups if these predominate in the population in 

25 general irrespective of the trait. 

Further examples of mis-match discrimination under different 
scenarios is given in the Appendix. 



30 



Selection of alleles informative for a trait 

Selection of the alleles linked to the trait of interest may be 
achieved in a number of ways. Disparity in the allele size of each VNTR 
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surviving successive rounds of the mis-match cleavage procedure may be 
identified by hybridisation of these alleles from each group of individuals to 
an array of VNTR alleles of known length and spatial separation such that 
differences can.be detected. Indeed, it may be possible to achieve 

5 quantitative hybridisation to an array in a similar manner that generates 
information regarding allele frequencies in the two groups without need of 
the mis-match cleavage procedure. 

A less elaborate procedure involves the subtraction of the 
alleles in one group from those in another to identify differences in allele 

10 frequencies. However, this method must identify not only a VNTR for 
which an allele is present in one group but no alleles survive in the other 
group, but also a VNTR for which the alleles surviving in each group are 
different since both of these scenarios suggest linkage disequilibrium with 
the trait of interest. This can be achieved physically, chemically or 

is enzymatically. If enzyme based selection is chosen it is preferable to 
amplify the alleles that have been enriched by the mis-match cleavage 
procedure with adapter primers that lack phosphorothioate bonds in order 
that enzyme digestion can proceed to completion. 

A suitable method of enzyme based selection involves the 

20 addition of protective termini, including but not limited to a 3 1 overhang of at 
least four nucleotides or an a-thiophosphate linkage, to the surviving alleles 
of one group of individuals and subtraction with an excess of those 
surviving from the other group using Exonuclease III. Under most 
circumstances identification is required of any allele surviving from the 

25 affected individuals that fails to survive from those individuals lacking that 
trait. For this, addition of the protective termini should added only to the 
VNTRs derived from affected individuals. Obviously, the alternative 
strategy is possible. A 3' overhang may be created in a number of ways 
including but not limited to (A) ligation of an adapter, or by (B) non-template 

30 addition of nucleotides by a DNA polymerase. Of these, method (B) was 
found to be the more efficient which may be achieved using an enzyme 
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such as Terminal deoxynucleotidyl transferase. This enzyme may 
generate a 3' overhang of several hundred nucleotides on incubation in the 
presence of a single deoxynucleotide triphosphate. An a-thiophosphate 
linkage may be incorporated by addition of a protective deoxynucleotide 

5 analogue using a DNA polymerase including but not limited to Terminal 
deoxynucleotidyl transferase. Suitable analogues include a-thio 
deoxynucleotide triphosphates. Since these analogues may inhibit 
subsequent digestion or manipulation of the DNA molecules the addition of 
a 3* overhang to impart protection is preferred. Another less preferred 

io method of imparting protection to the activity of Exonuclease III is through 
the action of an exonuclease with 5' to 3' activity, including but not limited 
to T7 gene 6 exonuclease, that may create a 5' recess in duplex DNA. The 
appropriate incorporation of phosphorothioate bonds within the adapter 
primer that is used to amplify the DNA molecules would ensure that 

15 digestion by T7 gene 6 exonuclease beyond that required to impart 

resistance to Exonuclease III is prevented. Similarly, a 5' recess could be 
created by incorporation of a uracil rich 5' end in the adapter primer which 
could be digested using an enzyme such Uracil DNA glycosylase. 



Ligation of alleles 
from the affected pool 
to a second adapter 



B 



dATP + Terminal 
deoxynucleotidyl 
transferase 



The resulting molecules are resistant to Exonuclease III 
digestion because of the; 3' overhang that is created. Hybridisation to an 
excess of the surviving wild type alleles ensures heteroduplex formation of 
25 all affected alleles providing an allele of the appropriate VNTR survives in 
the wild type group. 
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Hybridisation to an excess of 
alleles from the wild type pool 




If there are no wild type alleles to subtract from those of the 
5 affected group homoduplex molecules that possess a 3' overhang at each 
end will result (molecule 1). If the surviving allele of a VNTR differs 
between the two groups a heteroduplex molecule containing a mis-match 
will result (molecule 2). Surviving alleles of equal size in the two groups will 
give rise to heteroduplex molecules without a mis-match (molecule 3). The 
10 other species of DNA that will result from the hybridisation include 

homoduplexes of wild type alleles that may or may not contain a mis-match 
(molecule 4) and single stranded molecules that fail to hybridise. Digestion 
of these different types of molecule by an enzyme that acts on single 
stranded DNA or conformational irregularities in DNA, including but not 
15 limited to T4 endonuclease VII , results in cleavage of those duplexes 
containing a mis-match with the generation of a 3' overhang at the site of 
cleavage. 

j 

- i 

1 
1 

i 
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The subsequent digestion by Exonuclease III renders single 
5 stranded all duplexes or fragments of duplexes that do not possess a 3' 
overhang at each end. 




10 Since the digestion of susceptible molecules by Exonuclease 

III tends to go to completion further digestion with a single strand specific 
exonuclease or endonuclease eliminates all single stranded DNA species 
and removes the 3* overhang on the surviving molecules. Therefore, only 
the target molecules survive digestion. Exonuclease I is suited to this task 

15 but often leaves a single nucleotide 3' overhang that must be removed if 
blunt end cloning is chosen as the means by which the target molecules 
are recovered. 
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For the intact homoduplexes the informative allele is present 
within the homoduplex and may be identified by cloning and sequencing. 
For T4 endonuclease VII cleaved fragments that have survived digestion by 

5 Exonuclease III and Exonuclease I, the full length VNTRs can be obtained 
by hybridisation of the fragments to fragmented, terminated adapter-ligated 
genomic DNA followed by amplification in a similar manner to that 
previously described. The informative allele may be identified by 
genotyping the individuals manifesting the trait of interest with respect to 

10 these VNTRs using VNTR-specific primers designed from their flanking 
sequences. 

It is obvious that this method of subtraction is equally suited 
to other alleles besides those of VNTRs that may be generated in a variety 
of different ways. As such, this method of identifying differences in the 

15 composition of DNA pools may be applied more widely for selection of 
other types of polymorphic sequences as well as other species of DNA that 
may be present in one pool but absent in the same form in another. 

This method is unique in its suitability for investigation of 
polygenic as well as monogenic hereditary traits. It is likely to make a 

20 significant impact in the study of hereditary traits, reducing considerably the 
difficulty, time and expense that is currently associated with this field of 
research. 



The preferred embodiment 

25 (i) Fragmentation- of genomic DNA of an individual of the species 

under investigation, but not necessarily an individual in that investigation, 
with a single restriction enzyme. 
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(ii) Termination of all 3' ends by Terminal deoxynucleotidyl 
transferase in the presence of a dideoxynucleotide triphosphate. 

(iii) Ligation of the terminated fragments to an adapter by 
incubation in the presence of T4 DNA ligase, followed by termination of 

5 single-stranded nicks. 

(iv) Purification of the ligated products from the ddNTPs and 
amplification in reactions containing: 

a) adapter primer and an (AC)nB primer, where B=G+T+C; 

b) adapter primer and a (CA)nD primer, where D=G+A+T; 
10 c) adapter primer and a (GT)nH primer, where H=A+T+C; 

d) adapter primer and a (TG)nV primer, where V=G+A+C. 

The products of amplification result from genomic fragments 
that successfully ligate to the chosen adapter and contain a VNTR with 
homology to the chosen primer. 
15 (v) Digestion of the (AC)nB and (CA)nD primed products by T4 

DNA polymerase in the presence of dATP and dCTP, followed by 
Exonuclease VII to remove all VNTR sequences and excess VNTR primer. 

(vi) Digestion of the (GT)nH and (TG)nV primed products by T4 
DNA polymerase in the presence of dGTP and dTTP, followed by 

20 Exonuclease VII to remove all VNTR sequences and excess VNTR primer. 
Size selection may be performed to obtain products of an optimal range of 
molecular weights. 

(vii) Hybridization of an excess of either the combined (AC)nB 
and (CA)nD primed products or the combined (GT)nH and (TG)nV primed 

25 products with a sufficient amount of genomic DNAs derived from individuals 
manifesting a particular trait of interest. 

(viii) Incubation of the hybridized products with Taq DNA 
polymerase to achieve strand extension of all annealed 3' ends. 

(ix) Addition of adapter primer and generation of VNTR alleles 

30 from the 'genomic template 1 by thermal cycling in the presence of Taq DNA 
polymerase. ) 



! 
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(x) Purification of the generated VNTR alleles followed by strand 
dissociation and reannealing under stringent conditions. 

(xi) Digestion with T4 endonuclease VII of mis-match containing 
duplex molecules that result from hybridization of VNTR alleles to spurious 

5 products of amplification, or hybridization of VNTR alleles that differ among 
the individuals under investigation manifesting a particular trait of interest. 

(xii) Further digestion by T7 gene 6 exonuclease together with S1 
nuclease to remove VNTR sequence from cleaved molecules or eliminate 
them completely. 

io (xiii) Amplification of the surviving DNA molecules by thermal 

cycling in the presence of Taq DNA polymerase. 

(xiv) Repetition of hybridization, digestion and amplification of the 
surviving DNA molecules. This enriches the reaction in VNTR alleles that 
are common to all individuals manifesting the particular trait of interest or 

is those alleles that predominate in such a group and removes any spurious 
products of amplification. 

(xv) Addition of a 3' overhang to the selected alleles of the group 
of individuals manifesting a particular trait by incubation with Terminal 
deoxynucleotidyl transferase in the presence of a dNTP. 

20 (xvi) Hybridization of the selected VNTR alleles of the group of 

individuals manifesting a particular trait that possess a 3' overhang to an 
excess of the VNTR alleles of individuals in which the trait is absent that 
have been generated from their genomic DNAs in a method bearing 
similarity, wholly or in part, with (i) to(xiv). 

25 (xvii) Digestion of mis-match containing duplex molecules by T4 

endonuclease VII. 

(xviii) Further digestion by Exonuclease III to eliminate strands in 
duplex molecules that lack protection by a 3* overhang. 

(xix) Further digestion, after removal or inactivation of the 

30 Exonuclease III, by Exonuclease I to remove single stranded DNA. This 
results in elimination of all molecules other than the VNTRs linked to the 
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particular trait. For intact VNTRs the informative allele is present For 
cleaved VNTRs that survive digestion by Exonuclease III and 
Exonuclease I the entire VNTR sequence may be obtained after 
hybridisation to fragmented, terminated, adapter-ligated genomic DNA and 
5 strand extension by Taq DNA polymerase such that VNTR specific primers 
may be designed from the flanking sequences that allow genotyping of 
affected individuals to implicate the informative allele linked to the trait. 

A second embodiment 

10 (i) VNTR alleles are generated by means other than processes 

of amplification of fragmented and ligated genomic DNA with adapter 
primer and VNTR primer, hybridization of the generated products to 
genomic 'template* DNAs of individuals manifesting a particular trait, and 
generation of the respective VNTR alleles from those template DNAs. 

15 These may include but are not limited to: 

a) amplification of VNTRs from genomic or synthetic DNA using 
primers specific to the flanking regions of each VNTR in individual 
reactions; 

b) amplification of VNTRs from genomic or synthetic DNA using a 
20 multiplex system, thereby allowing amplification of multiple VNTRs en 

masse using adapted VNTR specific primers; 

c) amplification of VNTRs from genomic or synthetic DNA using an 
endonuclease that cleaves in or about VNTR sequences such that 
adapters may be ligated to the digested DNA and used for amplification of 

25 the VNTR alleles; 

d) generation of a pool of VNTRs from individuals manifesting a 
particular trait by processes of subtraction with those in which the trait is 
absent. \ 

(ii) Purification of the generated VNTR alleles followed by strand 
30 dissociation and reannealing under stringent conditions. 

(iii) Digestion vvith T4 endonuclease VII of mis-match containing 



urious products 



15 



20 
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several methods of differentiating mis-match containing duplexes from 
those that are free of mis-matches, either in solution or on an array. The 
methods described in the above embodiments represent only one of these 
methods. 

Those that are skilled in the art will appreciate that the 
invention is equally well suited any type of VNTR including but not 
restricted to dinucleotide repeats e.g.(CA)n and (GT)n, trinucleotide repeats 
e.g.(AAT)n, (AGC)n, (AGG)n, (CAC)n, (CCG)n and (CTT)n, and 
tetranucleotide repeats e.g.(CCTA)n, (CTGT)n, (CTTT)n.(TAGG)n, 
(TCTA)n, and (TTCC)n. In addition, the invention may be applied to simple 
organism microsatellites that include, but are not limited to, (AT), (CC), 
(CT) and (GA) rich tracts of repetitive motifs. 

Those that are skilled in the art will appreciate that 
polymorphic alleles, other than those of VNTRs, may be used with the 
invention to produce alleles that are free of spurious products of 
amplification and are common to all individuals manifesting a particular 
trait. These polymorphic alleles may be hybridized to a fixed array of all 
possible alleles, or subset thereof, or to a pool of alleles derived from 
individuals in which that trait is absent. By mis-match discrimination those 
alleles linked and informative for a trait can be identified. 

Those that are skilled in the art will appreciate that alleles 
from the genome of a single individual, or more than one individual, of 
unknown phenotype and genotype may be amplified with fidelity, removing 
the spurious products of amplification by mis-match discrimination, and 
hybridized to a fixed array of alleles, or to a pool of alleles in solution, in 
order assign a genotype or a phenotype to that individual. 

Those that are skilled in the art will appreciate that mis-match 
discrimination may be performed using enzymes or chemicals other T4 
endonuclease VII. These alternatives include but are not limited to S1 
so nuclease. Mung Bean nuclease, mutation detection proteins (e.g. Mut S). 
osmium tetroxide and hydroxylamine. 



25 
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Those that are skilled in the art will appreciate that the 
polymorphic sequences that are amplified are themselves valuable and 
may be used in protocols other than that which determines co-segregation 
of VNTRs with a hereditary trait including but not limited to genotyping, 
5 mapping, positional cloning, quantification of trait loci, studies of ancestry 
and evolution, population studies, phylogenetics, and the study in vitro as 
well as in vivo of VNTRs and the sequences that separate them. 

Those that are skilled in the art will appreciate that the invention 
may be used to identify somatic mutations that are non-hereditary if a 
o VNTR is involved in that mutation. 

Those that are skilled in the art will appreciate that the 
terminated and adapter-ligated genomic fragments may be used to 
recreate or amplify that region of the genome with sequence homology to 
any nucleotide sequence known or unknown to which they are hybridised. 
15 Those that are skilled in the art will appreciate that the 

method represents a means of purifying a consensus sequence from PCR 
products such that the spurious products of amplification are eliminated. 

Those that are skilled in the art will appreciate that the 
method represents a means of purifying a consensus sequence from any 
20 pool of one or more types of DNA molecule. 

The invention differs fundamentally from all previous 
techniques since genomic fragments are generated that do not reflect the 
polymorphic variation at the locus from which they were derived. 
Furthermore, these fragments need not be generated from an individual in 
25 a particular investigation, but may be from any individual of the appropriate 
species. However, hybridization of these fragments to genomic 'template' 
DNA of an individual subject to investigation and mis-match discrimination 
permits amplification, with fidelity, of alleles within that genomic template 
whilst overcoming the problems of generation of spurious products that are 
30 a feature of other PCR : -based methods. If the genomic fragments are 
derived from a single individual the problems of polymorphic variation 
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within the sequences that flank ach VNTR are negated because these will 
be identical for all individuals under investigation. Since the invention 
preserves each VNTR allele with its flanking sequences, these alleles 
remain highly informative. In this respect the invention is unique. 

5 Furthermore, this novel method of generating VNTRs is rapid, inexpensive, 
has no requirement for prior knowledge of sequence, and has no 
requirement for elaborate equipment, it is of immense importance obviating 
the high investment of time and money that is currently required for 
isolation of VNTRs. Consequently, the application of technologies 

10 dependant on the availability of VNTR in species in which none have been 
isolated will be possible where previously this was unfeasible. The ability 
to generate large numbers of VNTRs from all species quickly, efficiently, 
cheaply and with fidelity is a considerable contribution of the present 
invention to workers in the to the biomedical field. 

15 In summary, the invention involves a novel method of 

generating VNTRs encompassing restriction endonuclease digestion of 
DNA, ligation of the fragments to adapters and, by introduction of a primer 
with sequence homology to a chosen VNTR, amplifying only those 
fragments that are flanked by a chosen endonuclease restriction enzyme 

20 site and a VNTR. These fragments are not representative of the alleles of 
each VNTR and need not be generated from any specific individual under 
investigation. Hybridization of these fragments with genomic DNA of the 
individuals under investigation recreates the intact VNTR alleles with 
flanking sequence, as they occur in the genome. This in itself constitutes a 

25 major step in the ability of workers in the biomedical fields to generate 
quickly, efficiently, cheaply and with fidelity VNTRs in all species for 
purposes reliant on the availability of VNTRs, including but not confined to 
DNA fingerprinting and linkage analysis. The incorporation of a mis-match 
discrimination procedure overcomes the problems of miss-priming and 

so generation of spurious products by reaction contamination and subtle 
variation in reaction conditions, that are to the detriment of all PCR-based 
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technologies, and allows exclusion of alleles that are not common to all 
individuals under investigation that manifest a particular trait. A second 
round of mis-match discrimination removes un-informative alleles that are 
present in the genomes of individuals that do not manifest the trait. This 

5 procedure is designated a lotal Representation of Alleles that are 
Informative for a Trait (TRAIT). The invention, therefore, has significant 
advantages over previous methods, embracing the speed of analysis of 
AFLP, GMS, RDA and RAPD, and the high polymorphism detection rate of 
linkage analysis, but negating the need for DNA from closely related 

10 individuals and for paternity testing. The invention also overcomes 
fundamental problems that are a feature of PCR based technologies, 
including miss-priming and generation of spurious products through 
reaction contamination and subtle variations in the conditions of reaction. 
Furthermore, there is no requirement for expensive equipment or elaborate 

15 statistical computer software. The analysis will give rise to alleles that are 
both linked and informative, being present exclusively or at a higher 
frequency in individuals manifesting the trait of interest but absent or 
present at a lower frequency in those individuals that lack the trait. In this 
respect, the invention is unchallenged in its superiority over all other 

20 methods. 

The invention allows concomitant detection of polymorphisms 
at multiple loci by simultaneous comparison of simple or complex genomes 
from multiple individuals and differs fundamentally from all other techniques 
that have been previously employed. The invention represents a major 

25 advance in the ability of workers in the biomedical fields to generate 

VNTRs from the genomes of any species quickly, efficiently, cheaply and 
with fidelity in addition to screening complex genomes for polymorphisms 
co-segregating with hereditary traits. Application of this procedure will 
therefore facilitate the development of markers for genetic screening for 

30 hereditary disease, or advantageous monogenic or polygenic traits in all 
organisms. 



WO 98/42867 



PCT/GB98/00840 



-50- 

Exampl s fHowth Invention may be Appli d 

The following illustrations represent examples of how the 
invention may be applied without inferring any limitation to scope of the 
invention or any limitation to the different ways in which the invention may 
5 be applied. 

Experimental Data 

Example 1 

Preparation of amplimers using (CA)13 and (GU)13 primers. 

io 2ug DNA was completely digested with 3ul Rsa I in a total 

volume of 100ul: 

8.5ul genomic DNA (equivalent to 3ug DNA) 

1 0ul 1 0x reaction buffer 

3ul Rsa I (10u/ul; Promega) 
15 Za^dHjO 

100ul 

The reaction was incubated at 37°C over night followed by 
heat inactivated by incubation at 70°C for 20 minutes. The DNA was 
separated from the buffer by microconcentration (Microcon-100; Amicon). 
20 A volume of 1 0ul was recovered. 

2nmoles of 48mer and 2nmoles of 12mer oligonucleotides 
that constitute the adaptor were combined: 

1 5.9ul 48mer (equivalent to 2 nmoles) 

13.7ul 12mer (equivalent to 2 nmoles) 
25 1 0ul 1 0x ligase buffer (NEB) 

48.4ul dH-Q 

88ul 

The mixture was heated to 50°C and allowed to cool to 10°C 

over 1 hour. 

30 To the 88yl of annealed adaptor was added the 1 0ul of 

digested DNA and ligation of the adaptor to the genomic fragments was 
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10 



performed: 

88|j| annealed adaptor/ ligase buffer (containing ATP) 
10ul DNA 

2yLT4 DNA ligase (400 NEBu/ul) 
100pl 

The reaction was incubated at 16°C over night and then heat 
inactivated by incubation at 70°C for 20 minutes. 

The adaptor-ligated DNA fragments were separated from the 
buffer and non-ligated adaptor by microconcentration (Microcon-100; 
Amicon). A volume of 12ul DNA was recovered. 

The adaptor-ligated DNA fragments were incubated with Taq 
DNA polymerase in the presence of dideoxynucleotide triphosphates to 
prevent 3' extension of the adaptor and non-ligated DNA in subsequent 
manipulations: 

1 5 1 2ul microconcentrated DNA 

3ul 10x NH4 reaction buffer 
1ul50mMMgCl2 

1ul 10mMddATP 
1ul10mMddCTP 
20 1ul 10mMddGTP 

1 M> 10mM ddTTP 

1 pi Taq DNA polymerase (5u/ul; Bioline) 
2uldH 2 0 

30pl 

25 The reaction was incubated at 72°C for 2 hours. 

The adaptor-ligated DNA with terminated 3' ends was purified 
by phenol/chloroform extraction and microconcentration. The volume 
recovered was made up to 40ul and the concentration of DNA was gauged 
by gel electrophoresis. A concentration of 75ng/ul was determined. 

30 ( CA ) Primed amplimers and (GU) primed amplimers were 

generated in separate reactions: 
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10|ji 10x NH4 reaction buffer 
8|jl 50mM MgCl2 
1.5pl tOmMdNTPs 

1 pi adaptor-ligated DNA with terminated 3* ends 
5 4|jl (CA) or (GU) primer (25pmol/pl) 

73.5 M I dH = Q 
98pl 

The reaction was overlaid with mineral oil and heated to 95°C 
for 2 minutes, during which time 1pl Tag DNA polymerase (5u/pl; Bioline) 
10 and 2pl adaptor primer (50pmol/pl) were added. 

Thermal cycling was performed as follows: 95°C for 30 
seconds, then 72°C for 45 seconds for a total of 20 cycles, followed by 
72°C for 5 minutes. 

To the 100pl of (CA) primed products was added 5pl 
15 Exonuclease I (10u/pl) to remove the remaining (CA) primer. This reaction 
was incubated at 37°C for 30 minutes. 

To the 100pl of (GU) primed products was added 10pl Uracil- 
DNA glycosylase (1u/pl; NEB) to digest all uracil incorporated into the PCR 
products. This reaction was incubated at 37°C for 2 hours. 1pl 10mM 
20 dNTPs was added followed by 2pl T4 DNA polymerase (5u/pl; Epicentre 
laboratories) to remove the protruding (CA) strand that complemented the 
digested (GU) sequence. This reaction was incubated at 37°C for 5 
minutes. Both the pools of amplimers were phenol/chloroform extracted 
and microconcentrated (Microcon-100; Amicon). For each pool, the 
25 volume recovered were made up to 500pl, of which 5pl was analysed by 
spectrophotometry to determine the concentration of DNA. 

Equal amounts of (CA) and (GU) primed amplimers were 
hybridized to genomic template' DNA of a single individual prior to thermal 
cycling. In order to gauge the optimal ratio of amplimer to genomic 
30 'template' DNA several reactions were performed using various amounts of 
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'template' DNA while keeping the amount of amplimers constant: 

Template' DNA (ng) 0 0.1 1 10 100 1000 

Combined amplimers (ng) 111111 
5 5M NaCI (|j|) 0.22 0.22 0.22 0.22 0.22 0.22 

dH 2 0 (ul) To a final volume of 5.55ul 

Each reaction was overlaid with mineral oil and incubated at 
98°C for 5 minutes, after which the temperature was reduced stepwise to 
78°C over 4 hours. 
io The following was added to each hybridization: 

5ul 10X NH4 reaction buffer 
4ul 50mM MgCI 2 
0.75ul10mM dNTPs 
0.5ul adaptor primer (50pmol/ul) 
15 34.2ul dH z O 

Each reaction was spun briefly in a microfuge. They were 
heated to 72°C for 2 minutes and 0.5ul Taq DNA polymerase (5u/ul;Bioline) 
was added. The reactions were incubated at 72°C for a further 10 minutes, 
after which the temperature was raised to 95°C for 2 minutes. Thermal 

20 cycling was performed as follows: 95°C for 30 seconds, then 72°C for 1 
minute, for a total of 10 cycles. 

For each reaction 10ul of products amplified for 10 cycles 
were added to 40ul of reaction mix and amplified under the same 
conditions for an additional 22 cycles. 5ul of the ends products of 

25 amplification were run on an agarose gel. The reaction containing 100ng 
genomic 'template' DNA was found to yield the most products of 
amplification, equivalent to a ratio of 100:1 by mass of genomic template' 
DNA: amplimer. 

The invention was validated by cloning the products of 

30 amplification. Two colonies of Ecoli that had successfully transformed 
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were cultured, from which plasmids were later harvested. These plasmids 
were sequenced and were found to contain VNTR sequences at the 
multiple cloning sites. 



5 Further experimental data 

For the following experiments canine genomic DNA or cloned 
VNTR alleles amplified from canine genomic DNA were used. The cloned 
alleles were ligated into the Sma1 site of the pUC18 MCS, either side of 
which plasmid specific primers were designed for subsequent amplification 
10 of the plasmid inserts: 



Plasmid specific sense primer 

S- ATGCCTGC AGGTCGACTCTAGAGQA 



Plasmid specific antisense primer 
GGCTCGAGCTT AAGGGATATC ACTC -5 J 



GCCAAGCTTCC ATGCCTGC AGGTCGACTCTAGAGGATCCCCGGGTACCGAGCTCGAATTCCCTATAGTGAG 



Hind III 



Sphl 



Pstl 



Sail 
Acc I 
Hinc II 



Xbal 



BamHI 



Kpnl 



Sac I 



EcoRI 



Sma I 
Xmal 



All reagents were obtained from Amersham Pharmacia 
15 Biotech, or its subsidiary companies, unless stated otherwise. 

Oligonucleotides were obtained from Genset Corp., France. 
The VNTR primers (AC)1 1 B, (CA)1 1 D, (GT)1 1 H and (TG)1 1 V comprised 
eleven repetitions of the sequence shown in brackets followed by a 
degenerate base were B = C + G + T, D = A + G+T, H = A + C + T, and V 
20 = A + C + G. 



Example 2 

Generation of adapter-ligated, dideoxynucleotide terminated genomic 
fragments with (a) termination preceding adapter ligation and (b) 
25 adapter ligation preceding termination. 

(a) 5jig canine genomic DNA were fragmented with Hae III, the 

digestion proceeding to completion over 12 hours at 37°C: 
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4.4jil 

10ul 

2|il 

mi 

100>l 



1.135ng/|il genomic DNA 



10x restriction buffer 



10u/}i! Hae HI 
dH 2 0 



5 



Digestion was confirmed by electrophoresis of an aliquot of 
the reaction on a 1% agarose gel stained with ethidium bromide. 

The DNA was extracted (GFX purification column) and eluted 
in 50ul 5mM Tris pH8.5, of which 30ul was incubated with Terminal 
10 deoxynucleotidyl transferase for 3 hours at 37°C : 



by microconcentration (Microcon-30; Amicon) with successive additions of 
dhfeO between episodes of centrifugation. A volume of 35)il was 
20 recovered. 

An adapter was prepared by annealing two oligonucleotides, 
a 24mer (GsCsAsGsGAGACATCGAAGGTATGAAC, where 's' represents 
a phosphorothioate bond) and a 12mer (TTCATACCTTCG): 



15 



30iil DNA 

30>l 5x Terminal deoxynucleotidyl transferase buffer 
4.5^1 10mMddGTP 

1 0nl 9u/fil Terminal deoxynucleotidyl transferase 

ZL5uldH 2 0 

150^ 

The DNA was separated from low molecular weight solutes 



25 



7.6^1 197pmol/nl24mer 
9.2nM62pmol/nl12mer 

1 0x T4 DNA ligase buffer 

18.7ul 

The mixture was heated to 55°C and allowed to cool to 10°C 



over one hour. 



30 



The adapter was ligated to the terminated genomic 
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fragments: 

35^1 DNA 
18.7uJ adapter 

4.3nl 10x T4 DNA ligase buffer 
5 1.5jJ 10u/ul T4 DNA ligase 

l^nldHaO 
62^1 

The reaction was incubated at 16°C over night, then heat 
inactivated at 70° C for 20 minutes. 
io The DNA was separated from low molecular weight solutes 

by microconcentration (Microcon-30; Amicon) with successive additions of 
dH 2 0 between episodes of centrifugation. A volume of 54jil was 
recovered. 

To prevent generation of spurious products through priming 
is from sites of single strand nicks, these were terminated by incubation with 
Thermo Sequenase: 

54^1 DNA 

4.4ul Thermo Sequenase buffer 

1.4ul 10mMddATP 
20 1.4nM0mMddCTP 

1.4jil10mM ddGTP 

1.4nl10mM ddTTP 

0.5nl 32u/fxl Thermo Sequenase 

L5.nl dH 2 0 
25 70nl 

The mixture was overlaid with mineral oil and incubated at 
74°C for 2 hours. 

The DNA was extracted (GFX purification column) and eluted 
in 50nl 5mM Tris pH 8.5. 
30 (b) 5jig canine genomic DNA were fragmented with Mbo I, the 
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digestion proceeding to completion at 37°C: 



5 



4.4^1 1.135ng/jil genomic DNA 
10uJ 10x restriction buffer 
2.5^1 10u/nl Mbo I 
S2fildH 2 0 
100ul 

Digestion was confirmed by electrophoresis of an aliquot of 



the reaction on a 1% agarose gel stained with ethidium bromide. 

Following incubation at 70°C for 20 minutes the DNA was 
10 separated from low molecular weight solutes by microconcentration 
(Microcon-30; Amicon) with successive additions of dH 2 0 between 
episodes of centrifugation. A volume of 32^1 was recovered of which half 
was ligated to an adapter: 

An adapter was prepared by annealing two oligonucleotides, 
15 a 24mer (GsCsAsGsGAGACATCGAAGGTATGAAC, where 's' represents 
a phosphorothioate bond) and a 16mer (GATCGTTCATACCTTC): 



The mixture was heated to 55°C and allowed to cool to 10°C 
over one hour. 



20 



6.3jil1 97pmol/nl 24mer 
8.5ul 147pmol/nl 16mer 
l£5.n 10x T4 DNA ligase buffer 
16.5nl 



The adapter was ligated to the genomic fragments: 
16ii\ DNA 



25 



30 



16.5|il adapter 

2.4nl 10x J4 DNA ligase buffer 
2|xl 10ul/|il|T4 DNA ligase 

40^l : 

The reaction was incubated at 16°C over night, then heat 
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inactivated at 70°C for 20 minutes. 

The DNA was separated from low molecular weight solutes 
by microconcentration (Microcon-30; Amicon) with successive additions of 
dH 2 0 between episodes of centrifugation. A volume of 40nl was 
5 recovered. 

The adapter-ligated fragments were terminated using Thermo 
Sequenase: . 

40nl DNA " T 

4.4(il Thermo Sequenase buffer 
io 1.4|al 10mMddGTP 

0.5|il 32u/ul Thermo Sequenase 

24^1 dH 2 0 

70^1 

The reaction was overlaid with mineral oil and incubated at 
15 74°C for 1 hour. To prevent generation of spurious products through 
priming from sites of single strand nicks, these were terminated by further 
incubation with Thermo Sequenase and addition of the remaining ddNTPs: 
14jil 10mM ddATP 
1.4jxl10mMddCTP 
20 1.4^1 10mM ddTTP 

(LSnIThermo Sequenase buffer 
4.8nl 

The reaction was incubated at 74°C for a further hour. 

The DNA was extracted (GFX purification column) and eluted 
25 in 50jaI 5mM Tris pH 8.5. 

Methods (a) and (b) of adapter ligation and termination of the 
genomic fragments were compared by amplification of the resulting 
fragments with or without an 'internal 1 primer in reactions comprising the 
following: 



30 
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5(al 


5^1 


5|al 


10xTaq PCR buffer 




5|il 




10x dNTPs 


V 


VI 


VI 


25pmol/|il 24mer 


VI 


VI 




50pmol/|il(AC)11B 


50ng 


Ong 


50ng 


GFX extracted DNA 




to 50^1 


dH 2 0 



Each reaction was overlaid with mineral oil and heated to 
95°C for 2 minutes. 

0.5nl of 5u/\il Taq DNA polymerase was added to each 

10 reaction, which was amplified for 25 repetitions of 95°C for 30 seconds, 
65°C for 30 seconds, 72°C for 1 minute, followed by a final extension of 
72°C for 5 minutes. 

7.5^1 of each reaction was subjected to electrophoresis on a 
1.5% agarose gel stained with ethidium bromide. The negative control 

is reactions that lacked DNA generated no product, while those reactions 
containing all components generated a smear of products of various 
molecular weights. In contrast, the reactions containing DNA but no 
internal primer were incapable of generating product. These results 
confirmed that adapters had been ligated successfully to genomic 

20 fragments and all 3' ends capable of extension in the presence of a DNA 
polymerase had been terminated. The preferred method was termination 
prior to ligation since (i) this guaranteed that all fragments successfully 
ligating were terminated and (ii) the opportunities for inter-fragment ligation 
were remote. 

25 

Amplification of 5* and 3" flanking sequences from terminated, 
adapter-ligated genomic fragments. 

Amplification reactions were performed for each VNTR primer 
containing the following: 

. 30 

i 
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5\x\ 5\x\ 1 0x Taq PCR buffer 

Sfxl 5^x1 10xdNTPs 

2\i\ 2\x\ 25pmol/^I 24mer 

2fxl 2\x\ 25pmol/^l (AC)1 1 B or (CA)1 1 D or (GT)1 1 H or 



(TG)11V 



2nl OjjlI fragmented, terminated, adapter-ligated 
genome (approx. SOng/jxl) 

34|ai m\xl dH 2 0 
50nl 50^il 

io In addition, a parallel reaction was prepared containing all 

components except a VNTR primer. 

All reactions were overlaid with mineral oil and heated to 
95°C for 2 minutes. O.Sjil of 5u/^l Taq DNA polymerase was added to each 
tube and amplification was achieved by thermal cycling for 18 repetitions of 

is 95°C for 30 seconds, 65°C for 45 seconds, 72°C for 45 seconds, followed 
by a final extension of 5 minutes at 72°C. 

5^1 of each reaction was loaded onto a 1.5% agarose gel 
stained with ethidium bromide, along with a molecular weight marker. The 
reactions that contained all components generated a smear of products of 

20 ranging from approximately 100 to 500bp, the intensity and distribution of 
molecular weights being comparable for each reaction. The lanes 
corresponding to those reactions lacking DNA and the reaction lacking a 
VNTR primer did not contain any product of amplification. 

25 Example 3 

The efficiency of digestion of the repeat sequence from a VNTR 
primed PCR product by T4 DNA polymerase was assessed. 

A cloned VNTR allele was amplified by Taq DNA polymerase 
and separated from low molecular weight solutes by microconcentration 
30 (Microcon-30; Amicon) with successive additions of dH 2 0 between 
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episodes of centrifugation. A volume of 40nl was recovered, the 
concentration of which was judged by agarose gel electrophoresis to be 
130ng/nl, approximating to 1.3pmol/jxl. 

A 1 .5u/|il dilution of T4 DNA polymerase was prepared with 
5 dH 2 0. The amplified DNA was digested at a concentration of 0.3pmol/|il 
with varying concentrations of T4 DNA polymerase at 12°C: 

1 .5nl 1 0x T4 DNA polymerase buffer 

0.75nl 10mMdATP 

0.75nl 10mMdCTP 
10 3.5jil DNA 

0, 0.5, 1 , 2, or 4jxl 1 .5u/fxl T4 DNA polymerase 

to 15nl dH 2 0 

Parallel reactions were prepared that lacked dNTPs. 
The reactions were incubated at 12°C for 1 hour, followed by heat 
15 inactivation at 70°C for 20 minutes. 

7.5u,l of each reaction were subjected to electrophoresis on a 
2.5% agarose gel stained with ethidium bromide. In the absence of dNTPs 
all DNA was digested with enzyme concentrations exceeding 0.05u/nl. By 
contrast, there was no discernible loss of DNA in the presence of dNTPs at 
20 any concentration of T4 DNA polymerase. 

The efficiency of digestion of the repeat sequence from a VNTR 
primed PCR product by T7 gene 6 exonuclease was assessed. 

A cloned VNTR allele was amplified with the plasmid specific 
25 sense primer and the (GT)1 1 H primer by Taq DNA polymerase in the 
presence of [a-33P] dATP. Parallel reactions were performed for primers 
that contained or lacked a succession of four phosphorothioate bonds. In 
the primer pair containing phosphorothioate bonds these where located at 
the 5' end of the plasmid specific primer and at the 3' end of the (GT)1 1H 
30 primer. 
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The amplified DNA was separated from low molecular weight 
solutes by microconcentration (Microcon-30; Amicon) with successive 
additions of dH 2 0 between episodes of centrifugation. Equal amounts of 
the amplification reactions were digested by T7 gene 6 exonuclease at 
5 37°C for 15 and 30 minutes, the concentration of DNA approximating to 
0.1pmol/ul: 

3.6jxl DNA 

2(il 5x T7 gene 6 exonuclease buffer 
1 \x\ 1 0u/|il T7 gene 6 exonuclease 
10 SAvlI dH 2 0 

10jil 

A control reaction was incubated for 15 minutes at 37°C in the 
absence of enzyme. 

All reactions were denatured at 95°C for 2 minutes with 
is addition of 5^1 formamide loading dye. 10jjtl of each sample was subjected 
to electrophoresis on an 8% polyacrylamide denaturing gel. An 
autoradiography film (Biomax MR; Kodak) was exposed to the gel after it 
had been fixed and dried. 

It was found that after 15 minutes of incubation the DNA that 
20 lacked phosphorothioate protection had been digested completely. By 
contrast, the presence of phosphorothioate bonds preserved the DNA, one 
strand in each molecule becoming shortened by digestion of the enzyme, 
although some non-specific loss of DNA was seen. 



25 The efficiency and specificity of digestion by T4 endonuclease VII and 
S1 nuclease was compared. 

Cloned VNTR alleles of the same VNTR that differed in their 
repeat lengths by 4 nucleotides were amplified separately in the presence 
of [a-33P] dATP. The products derived from the shorter allele were divided 
30 equally between two tubes. To one tube an equal amount of the longer 
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allele was added and the mixture was hybridised by denaturing at 98°C for 
2 minutes and annealing at 75°C for 150 minutes in 100mM NaCI and 
200>iM CTAB. 

The hybridised and non-hybridised pools of DNA were 

5 separated from other low molecular weight solutes by microconcentration 
(Microcon-30; Amicon) with successive additions of dH 2 0 between 
episodes of centrifugation . 

T4 endonuclease VII was diluted to 250u/^l in the supplied 
dilution buffer. Dilutions of S1 nuclease were prepared in dH 2 0. Equal 

io amounts of either hybridised DNA or non-hybridised DNA were digested by 
5Gu/nl T4 endonuclease VII in Taq DNA polymerase buffer or by various 
concentrations of S1 nuclease in the supplied buffer. The S1 nuclease was 
added to the reactions to give final concentrations of 0.01 u/nl, 0.03u/fil, 
0.1u/|il, and 0.3u/^tl. In each case a control reaction that lacked enzyme 

is was prepared. The reactions were performed at 37°C for 30 minutes. 

On completion of digestion the reactions were stopped by 
addition of EDTA and heat inactivation. An amount of formamide loading 
dye equal to half the reaction volume was added and each reaction was 
denatured by incubation at 95°C for 5 minutes. 12 of each sample were 

20 subjected to electrophoresis on an 8% polyacrylamide denaturing gel. An 
autoradiography film (Biomax MR; Kodak) was exposed to the fixed and 
dried gel. 

T4 endonuclease VII was found to cleave about half of all 
DNA derived from hybridisation of approximately equal amounts of two 

25 different alleles of the same VNTR, creating a characteristic pattern of 
cleaved products corresponding to the position of the mis-match within the 
repeat sequence at the time of cleavage, The DNA derived from the single 
allele that had not been hybridised and, therefore, comprised mis-match 
free double stranded DNA; was not affected by T4 endonuclease VII. In 

30 contrast, the characteristic pattern of cleaved products that was seen with 

i 

T4 endonuclease VII was pot seen in association with S1 nuclease under 

i 
t 
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any of the reaction conditions. As such, T4 endonuclease VII was 
considered the better of the two enzymes in this application. 

Repetition of the T4 endonuclease VII reactions using various 
concentrations of enzyme for 30 minutes and 1 hour of digestion in 1 x Taq 

5 PCR buffer, 1x Pfu buffer (Stratagene) and 1x T7 gene 6 exonuclease 
buffer confirmed that the enzyme digested predictably and reproducibly 
over a range of reaction conditions, their being no overt non specific 
digestion of DNA detectable at concentrations up to 200u/^l. The enzyme 
was found to cleave hybridised molecules containing mis-matches of a 

10 range of sizes. 

The characteristic pattern of cleaved products resulting from a 
mis-match within a repeat sequence was seen with S1 nuclease only when 
large amounts of DNA were loaded onto a polyacrylamide gel. This was 
seen with a four nucleotide mis-match. The ability of S1 nuclease to 

is resolve a two nucleotide mis-match was found to be poor. 

The effect of enzyme concentration on the efficiency of cleavage of 
mis-match containing duplex DNA by T4 endonuclease VII was 
assessed. 

20 Two cloned VNTR alleles that differed in allele length by 2 

nucleotides were amplified separately using the plasmid specific primers, 
one of which had been labelled with [y-33P] ATP using T4 polynucleotide 
kinase. Each amplified allele was separated from low molecular weight 
solutes by microconcentration (Microcon-30; Amicon) with successive 

25 additions of dh^O between episodes of centrifugation. 

Half of the DNA derived from amplification of the smaller 
allele was saved. To the remaining half was added approximately an equal 
amount of amplified DNA of the larger allele. This mixture was denatured 
at 98°C for 2 minutes and then annealed at 75°C for 2 hours in the 

30 presence of 100mM NaCI and 200^M CTAB, the transition between 

temperatures occurring rapidly. Separation of the annealed DNA from low 
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molecular weight solutes by microconcentration was repeated. 

Serial dilutions of T4 endonuclease VII were prepared in the 
supplied dilution buffer. The non-denatured smaller allele and the allele 
mixture that had been denatured and annealed were each digested in Taq 
5 DNA polymerase buffer with T4 endonuclease VII at final concentrations of 
Ou/nl, SOu/jil, 1 0Ou/jil and 150u/^l: 

6jxl DNA 

1^1 1 0x Taq PCR buffer 
3^1 T4 endonuclease VII 
10 10nl 

Incubation at 37°C was carried out for 30 minutes, after which 
each reaction was heated to 95°C for 2 minutes with addition of 5^1 
formamide loading dye. 10^1 volumes were subjected to electrophoresis 
on an 8% polyacrylamide denaturing gel, after which the gel was fixed, 
15 dried and exposed to an autoradiography film (Biomax MR; Kodak). 

Almost no digestion of the non-denatured smaller allele was 
detected. The little that was seen was assumed to have occurred as a 
result of digestion at sites of polymerase error or the annealing of stutter 
bands during the final cycle of amplification. In the lanes corresponding to 
20 the annealed allele mixture the characteristic pattern of digestion was seen 
to occur in the presence of T4 endonuclease VII. Although the amount of 
digestion at 1 00u/(al appeared to be slightly greater than at 50u/nl, the 
degree of digestion at each enzyme concentration was found to be almost 
uniform. 

25 Similar experiments were performed using various 

concentrations of T4 endonuclease VII in Pfu buffer (Stratagene) and T7 
gene 6 exonuclease buffer. Efficient digestion of mis-match containing 
DNA was found to occur in both reaction buffers, the degree of digestion 
maximising at concentrations of T4 endonuclease VII between 50u/(jiI and 

30 100u/nl. Duplex DNA lacking a mis-match was resistant to T4 
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endonuclease VII under these conditions. 

The efficiency and specificity of S1 nuclease digestion in 17 gene 6 
exonuciease buffer was assessed. 

5 A cloned VNTR allele was amplified with the plasmid specific 

primers, one of which had been labelled with [y-33P] ATP using T4 
polynucleotide kinase. The amplified product was separated from low 
molecular weight solutes by microconcentration (Microcon-30; Amicon) with 
successive additions of dH 2 0 between episodes of centrifugation. The 

10 volume of recovered DNA was divided: 30^1 was preserved as double 

stranded DNA while the remaining 30^1 DNA was rendered single stranded 
by denaturation at 98°C for 2 minutes followed by snap cooling on iced 
water. 

Dilutions of S1 nuclease were prepared in dH 2 0. Equal 
is amounts of double stranded DNA or single stranded DNA were digested in 
T7 gene 6 exonuciease buffer at 37°C for 5 minutes in the presence of S1 
nuclease at final concentrations of Ou/^il, 0.1u/nl, 0.3u/|il, lu/^I and 3u/|il. 
On completion of digestion the reactions were stopped by addition of 
5Q0mM EDTA pH8 to a final concentration of 25mM. 
20 The reactions were denatured by addition of formamide 

loading dye and heating to 95°C for 3 minutes, after which aliquots were 
subjected to electrophoresis on an 8% polyacrylamide denaturing gel. The 
gel was fixed, dried, and exposed to an autoradiography film (Biomax MR; 
Kodak). 

25 It was found that a concentration of 1 u/pJ S1 nuclease in T7 

gene 6 exonuciease buffer produced optimal digestion of single stranded 
DNA, there being no overt loss of double stranded DNA at this 
concentration. 
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Assessment of the digestion of DNA by T7 gene 6 exonuclease in 
concert with S1 nuclease. 

For assessment of T7 gene 6 exonuclease and S1 nuclease, 
DNA was amplified from a cloned VNTR allele using the plasmid specific 
5 sense primer with four phosphorothioate bonds at the 5' end and either the 
(AC)1 1 B primer containing four phosphorothioate bonds at the 3' end or the 
(AC)11B primer that lacked such bonds. The amplified products were 
separated from low molecular weight solutes by microconcentration 
(Microcon-30; Amicon) with successive additions of dH 2 0 between 
10 episodes of centrifugation. The volumes recovered in each case were 
measured to be 40^1. These were found to contain approximately 
1.3pmoi/nl and 0.35pmol/fil for the reactions primed by the VNTR primer 
with and without phosphorothioate bonds, respectively. 

T7 gene 6 exonuclease was diluted to 10u/^l in dH 2 0. 
15 S1 nuclease was diluted to 10u/^il in dH 2 0. 

Each amplified product, at a concentration of approximately 
0.1pmol/nl t was digested by T7 gene 6 exonuclease. In addition, the DNA 
generated with the (AC)11B primer containing phosphorothioate bonds was 
digested by T7 gene 6 exonuclease in concert with S1 nuclease: 

20 

without PT boncfo 

4*1) 4>il 
5.7nl 1.6^1 
0. 2, 4, 8fi! 
25 0p\ Ojil 

to 20nl to 20jal 

Each reaction was incubated at 37°C for 10 minutes, after 
which 500mM EDTA pH8 was added to each tube followed by 
incubation at 70°C for 20 minutes. 
30 10^1 of each digest was subjected to electrophoresis on a 

2.5% agarose gel stained with ethidium bromide. Lanes corresponding to 
reactions lacking enzyme contained a discrete band of the expected 



with PT bono 1 ? with PT bonds 

4^1 5x T7 gene 6 buffer 

1.6nl DNA 

0, 2, 4, 0, 2, 4, 8*il 10u/|il T7 gene 6 exonuclease 

2\i\ 10u/}itS1 nuclease 

to 20nl dH 2 0 
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molecular weight. The appearance of a lower molecular weight band, 
corresponding to single stranded DNA, was seen at a concentration of 
1 u/fil T7 gene 6 exonuclease for DNA primed by the (AC)1 1B primer that 
lacked phosphorothioate protection. At concentrations exceeding this 
5 virtually all DNA was single stranded. In contrast, DNA protected by 

phosphorothioate bonds at each end did not appear to alter significantly in 
molecular weight at any of the concentrations of 17 gene 6 exonuclease, 
but a decrease in the amount of DNA was evident with increasing 
concentrations. Similarly, DNA protected at each end was resistant to 
10 digestion of T7 gene 6 exonuclease in combination with S1 nuclease. 
Concentrations of 1u/jil T7 gene 6 exonuclease with 1u/^l S1 nuclease in 
T7 gene 6 exonuclease buffer containing approximately 0.1 pmol/nl DNA 
appeared to give the best results. 

15 The mis-match discrimination procedure was assessed using a model 
system comprising three alleles of the same VNTR in concert with a 
single allele of a second VNTR. 

A mixture of VNTR alleles was prepared that contained three 
alleles of the same VNTR, (AC)10, (AC)11, and (AC)18, in a 2 : 1 : 1 ratio 

20 respectively. In addition, an amount of the (CA)16 allele of a second 
VNTR, equal to that of the (AC)1 1 and (AC)1 8 alleles, was added to the 
mixture. Using Pfu DNA polymerase (Stratagene) 1ng of the mixture was 
amplified by PCR in a reaction volume of 100fil containing 60 pmoles of 
each plasmid specific primer, the sense primer having been labelled with 

25 [y-33P] ATP. Thermal cycling was performed for 17 repetitions of 95°C for 
30s, 65°C for 30s, 72°C fof 45s, followed by a final extension of 72°C for 5 
minutes. j 

The amplified DNA was separated from low molecular weight 
solutes by microconcentration (Microcon-30; Amicon) with addition of dH 2 0 

i 

30 between episodes of centrifugation. The recovered DNA was denatured at 
98°C for 2 minutes and the'n annealed at75°C for 2 hours in 100mM NaCI 

t 
i 

i 
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15 



20 



25 



30 



and 200 MM CTAB, the transKion between ternperatures telng ^ 
The hybridised DNA was separated tam low mo|ecu|ar 

Th O H t S " V miCroC0 ~°" < M ~3«; Amicon, with addttion 
of dH 2 0 between episodes of centrifugation, and digested by T4 
. endonuclease VII in Taq DNA polymerase buffer containing DOufrl <*■» 
enzyme inatota, volume of36ul. Digestion proceeded at 37°C for 1 hour 
after which the reaction was incubated at 75°C for 15 minutes. 

The digested DNA was separated from low molecular weight 
so utes by microconcen.ra.ion (Microcon-30; Amicon, with addition of dH 2 0 
between episodes of centrifugation. Further digestion was performed in a 
SOU reaction containing 1u¥ tt flene 6 exonuclease and 1u,ul si 
nuclease in T7 gene 6 exonuclease buffer a. 37-C for 10 minutes The 
reaction was stopped by addition of 2ul 500mM EDTA pH8 and heating to 
75°C for 10 minutes. 

Microconcentration was performed (Microcon-30; Amicon) 
w,.h addition of dH 2 0 between episodes of centrifugal, A volume of 48», 
was recovered of which 4ul was amplified by PCR, as before. This was 
followed by a second round of the mis-match discrimination procedure 

Aliquots of the amplified DNA before and after each round of 
*. rms-match discrimination procedure were subjected to electrophoresis 
on an 8 A polya^amide denaturing gel. In addWon, for comparison of me 
molecular weight of each product, the PCR products of each allele 
amplified in isolation were loaded onto the gel. 

It was found that Pfu generated numerous stutter bands in 
each amp,if,cation reaction. The amount of the (ACJ10 allele in the mixture 
pnorto mis-match discrimination was approximately twice ttat of all other 
alleles. These others were present in approximately equal amounts. After 
the first round of mis-match discrimination obvious enrichment of the 
(AC,10allelewasseen. This was enhanced by the second round of mi^ 
match discrimination giving rise ,o a ve^ strong band corresponding ,o ,he 
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(AC)10 allele and marked reduction of the (AC)11 and (AC)18 alleles. 
Although a band corresponding to the (CA)15 allele of the second VNTR 
was present after the second round of mis-match discrimination it was not 
as bright as that of the enriched (AC)10 allele. This was considered to 
5 reflect the inequality in the total DNA of each VNTR within the mixture and 
the consequential relative inefficiency of hybridisation following second 
order kinetics. This experiment confirmed that mis-match discrimination 
enriches the allele in a mixture of alleles of the same VNTR that has the 
highest frequency. 

10 

E x a mple 4 

The protocol was assessed using the pooled genomes of several 
dogs. 

In the absence of DNA samples from individuals affected and 
is unaffected by a hereditary trait the protocol was validated on a model 
system designed to mimic a scenario of VNTR linkage disequilibrium that 
would be expected in the presence of a recessive trait. 

A total of 43 dogs were genotyped with respect a VNTR 
previously isolated in the dog using VNTR specific primers. The VNTR 
20 primer pair comprised (CACTTGGGACTTTGGATTGGTCA) sense primer 
and (GTCTTTGTTTCCATTCTTGCTTGC) antisense primer. 

Amplification reactions by PGR were performed in a volume 
of 10fil containing 20ng genomic DNA and 4pmoles of each VNTR specific 
primer. In each case the VNTR specific sense primer was labelled and 
25 added to an amplification reaction master mix: 

1 .5(jiI 1 0x T4 polynucleotide kinase buffer 
2.4^1 SOpmpl/^il VNTR specific sense primer 
4.5ni [y-33]ATP 

1 in 3|dilution of 30 u/^l T4 polynucleotide kinase 
30 5.6nl dH 2 0! 

15^ 
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The reaction was incubated at 37°C for 1 hour, then 90°C for 

5 minutes. 

The T4 polynucleotide kinase reaction was added to a PCR 

master mix: 

5 15|il T4 polynucleotide kinase reaction 

45|al 10x Taq DNA polymerase buffer 
45^1 10xdNTPs 

2.4^1 50pmol/^l VNTR specific antisense primer 
4.5jil 5u/jil Taq DNA polymerase 
10 293^ dH 2 0 

405^ 

For each dog of 20ng/nl genomic DNA was added to 9^1 
of PCR master mix which was overlaid with mineral oil. Each reaction was 
placed onto a preheated thermal cycler at 95°C and incubated for 2 

15 minutes. Thermal cycling then followed with 28 repetitions of denaturation 
at 95°C for 30s, annealing at 65°C for 30s, and extension at 72°C for 30s, 
followed by a final extension of 72°C for 5 minutes. 

On completion of thermal cycling 5pJ of formamide loading 
dye was added to each reaction with denaturation at 90°C for 3 minutes 

20 prior to electrophoresis at 60W on an 8% polyacrylamide denaturing gel. 
The gel was fixed in 10% methanol/10% glacial acetic acid and dried. An 
autoradiography film (BioMax MR; Kodak) was exposed to the gel 
overnight. 

The genotype of each dog was scored with respect to the 
25 VNTR. Ten dogs were selected to represent the 'affected pool 1 of 

individuals and ten were selected to represent the 'wild type pool'. This 
selection was made in order to achieve a scenario that may mimic a 
recessive trait: 
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10 



15 



20 



Affected 

(AC)n 

(AC)n+1 

(AC)n+2 

(AC)n+3 

(AC)n+4 

(AC)n+5 

(AC)n+6 

(AC)n+7 

Wild ty pe 

(AC)n 

(AC)n+1 

(AC)n+2 

(AC)n+3 

(AC)n+4 

(AC)n+5 

(AC)n+6 

(AC)n+7 



Allele frequency 

100% 

0% 

0% 

0% 

0% 

0% 

0% 

0% 

Allele frequency 

15% 

0% 

0% 

0% 

35% 

20% 

0% 

30% 



Amplimers were prepared from genomic DNA of a single dog. 
In a 100^1 volume 5ng of genomic DNA were digested by 20 units Hae III, 
the digestion proceeding to completion over 12 hours at 37°C: 
4.4ul 1.135|i.g/|j.l genomic DNA 
25 10^1 10x restriction buffer 

2\i\ 10u/u.l Hae III 
Mul dH 2 0 
100fj.l 

Digestion was confirmed by electrophoresis of an aliquot of 
30 the reaction on a 1% agarose gel stained with ethidium bromide. 

The DNA was extracted (GFX purification column) and eluted 
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in 50jil 5mM Tris pH8.5, of which approximately 3jxg contained within 30^1 
was incubated with Terminal deoxynucleotidyl transferase for 3 hours at 
37°C : 

30 M I DNA 

5 30>l 5x Terminal deoxynucleotidyl transferase buffer 

4.5pJ 10mMddGTP 

1 0jil 9u/jil Terminal deoxynucleotidyl transferase 

Z5^nidH 2 0 

150ul 

10 The DNA was separated from low molecular weight solutes 

by microconcentration (Microcon-30; Amicon) with successive additions of 
dH 2 0 between episodes of centrifugation. A volume of 35nl was 
recovered. 

An adapter was prepared by annealing two oligonucleotides, 
15 a 24mer (GsCsAsGsGAGACATCGAAGGTATGAAC, where 's' represents 
a phosphorothioate bond) and a 12mer (TTCATACCTTCG): 

7.6nl 197pmol/^ 24mer 

9.2uJ 162pmol/^ 12mer 

±£Znl 10x T4 DNA ligase buffer 
20 18.7^ 

The mixture was heated to 55°C and allowed to cool to 10°C 
over one hour. 

The adapter was ligated to the terminated genomic 

fragments: 
25 35|il DNA 

18.7^1 adapter 

4.3nl 10xT4 DNA ligase buffer 
1.5|il 10u/nlT4 DNA ligase 
2.5nl dH 2 0 
30 62nl 
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The reaction was incubated at 16°C over night, then heat 
inactivated at 70°C for 20 minutes. 

The DNA was separated from low molecular weight solutes 
by microconcentration (Microcon-30; Amicon) with successive additions of 
5 dH 2 0 between episodes of centrifugation. A volume of 54jil was 
recovered. 

To prevent generation of spurious products through priming 
from sites of single strand nicks, these were terminated by incubation with 
Thermo Sequenase: 



10 


54jil 


DNA 




4.4jil 


Thermo Sequenase buffer 




1.4nl 


10mM ddATP 




1.4jil 


10mM ddCTP 




1.4nl 


10mM ddGTP 


15 


1.4jil 


10mM ddTTP 




0.5jil 


32u/|il Thermo Sequenase 




5,5^1 


dH 2 0 




70nl 





The mixture was overlaid with mineral oil and incubated at 
20 74°C for 2 hours. 

The DNA was extracted (GFX purification column) and eluted 
in 5Q\i\ 5mM Tris pH 8.5. 

Amplimers were prepared from this DNA using VNTR primers 
and the 24mer oligonucleotide contained within the adapter as the adapter 
25 primer: 

5|il 1 0x T^q DNA polymerase buffer 

5nl lOxd^TPs 

2\i\ 25pmbl/uJ adapter primer 

2\il 25pmpl/|d VNTR primer [ (AC)1 1 B, (CA)11D, (GT)11H, 
30 or(TG)11V] ! 
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2^1 terminated, adapter-ligated DNA fragments (approx. 

SOng/^l) 

34^1 dH 2 0 
50^l 

5 Similar reactions were prepared containing a VNTR primer 

but in the absence of genomic DNA. In addition, a single reaction was 
performed containing genomic DNA but in the absence of a VNTR primer. 
All reactions were overlaid with mineral oil and incubated at 95°C for 2 
minutes. Addition of 0.5|il of 5u/jxl Taq DNA polymerase was made to each 

10 reaction. Amplification was achieved by thermal cycling for 18 repetitions 
of 95°C for 30 s, 65°C for 45s, 72°C for 45s ( followed by a final extension of 
72°C for 5 minutes. 

On completion of amplification 5ii\ of each reaction were 
subjected to electrophoresis with a molecular weight marker on a 1 .5% 

15 agarose gel stained with ethidium bromide. The presence of amplified 
products in the lanes representing reactions containing template DNA and 
a VNTR primer confirmed that ligation of the genomic fragments to adapter 
sequence had occurred. In each case the appearance of these lanes was 
similar, there being a smear of amplified products distributed over a range 

20 of molecular weights from approximately 100bp to 500bp. All other lanes 
lacked product of amplification. The fact that the reaction containing 
template DNA but no VNTR primer did not generate product confirmed that 
the all 3* ends had been terminated successfully such that chain extension 
in the presence of Taq DNA polymerase was prevented. 

25 The (AC)1 1 B and (CA)1 1 D primed reactions were combined. 

Also, the (GT)1 1 H and (TG)1 1 V primed reactions were combined. Both 
amplimer pools were separated from low molecular weight solutes by 
microconcentration (Microcon-30; Amicon) with addition of dH 2 0 between 
episodes of centrifugation| Quantification by agarose gel electrophoresis of 

so the recovered DNA suggested that each contained approximately 35ng/jil 
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amplimer DNA. 

The repeat sequences were removed from the pooled 
(AC)11B and (CA)11D primed products using T4 DNA polymerase and 
Exonuclease VII: 

5 14^1 35ng/^il (AC)1 1B/(CA)1 1 D primed amplimer DNA 

2\x\ 1 0x T4 DNA polymerase buffer 
1H 10mMdATP 
1^1 10mMdCTP 

2^1 1 in 4 dilution of 4u/|d T4 DNA polymerase 

to 20j.il 

The reaction was incubated at 12°C for 1 hour then 
inactivated at 70°C for 20 minutes. 

To the reaction was added of 1 0u/jal Exonuclease VII with 
incubation at 37°C for 30 minutes followed by 70°C for 20 minutes. 

15 The designated affected and wild type DNA pools were 

prepared by combining equal amounts of genomic DNA, quantified by 
spectrophotometry, of the selected dogs. These were phenol/chloroform 
extracted and microconcentrated (Microcon; Amicon) with addition of dH 2 0 
between episodes of centrifugation. 

20 Each pool of genomic DNA was digested by Hae III, 

terminated using Terminal deoxynucleotidyl transferase, and ligated to the 
adapter in a manner similar to that previously described. Complete 
termination of all 3* ends was confirmed by PCR with the adapter primer. 
The DNA pools were quantified by agarose gel electrophoresis and were 

25 found to contain approximately equal concentrations. 

In a minimal volume 2.5^1 of the 35ng/jil (AC)/(CA) primed 
amplimer pool, digested with T4 DNA polymerase and Exonuclease VII, 
were hybridised in 0.6M NaCI to approximately 300ng of the 'affected 1 
genomic DNA pool that had been fragmented, terminated, and ligated to 

30 the adapter. This was achieved by denaturing the mixture under mineral oil 
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at 98°C for 3 minutes, followed by a stepwise reduction in the temperature 
from 80°C to 70°C over ten hours and sustaining the final temperature for a 
further 10 hours. The wild type pool was hybridised in a similar manner in 
parallel. 

5 To each hybridisation were added: 

20nl 1 0x Taq DNA polymerase buffer 
20nl 10xdNTPs 
160^ 1 dH z O 
200jil 

io In each case the total volume containing the hybridised DNA 

was divided between two reaction tubes. Under mineral oil each volume 
was heated to 75°C. 1jU of 5u^l Taq DNA polymerase was added to each 
tube followed by incubation at 72°C for 10 minutes. 
The reactions were denatured at 95°C for 3 minutes and 4ul of 25pmol/ul 

15 adapter primer were added. Amplification of the hybridised DNA was 

achieved by thermal cycling for 30 repetitions of 95°C for 30s, 65°C for 30s, 
72°C for 90s, followed by a final extension of 72°C for 5 minutes. 

The reactions containing affected DNA were pooled, as were 
the reactions containing wild type DNA, and 8p.l of lOu/jil Exonuclease I 

20 were added to each 200^.1 volume of amplified DNA. The reactions were 
incubated at 37°C for 15 minutes. 

For each reaction the DNA was separated from low molecular 
weight solutes (Microcon-30; Amicon) with addition of dH 2 0 between 
episodes of centrifugation. In each case a volume of 10^1 was recovered. 

25 The alleles contained within each sample were denatured and allowed to 
anneal by incubation under mineral oil at 98°C for 5 minutes followed by a 
rapid reduction in temperature to 75°C. At 75°C 2M NaCI and 10mM CTAB 
were added to give final concentrations of 50mM and 500uM, respectively. 
The hybridisation reactions were incubated at 75°C for a further 16 hours. 

30 To each hybridisation reaction was add d 150^1 of 5mM Tris 



WO 98/42867 



PCT/GB98/00840 



-78- 

pH 8.5. The diluted hybridisation reactions were then separated from low 
molecular weight solutes (Microcon-30; Amicon) with addition of dH 2 0 
between episodes of centrifugation. These were judged to contain 
approximately 1 0pmoles DNA. Digestion by T4 endonuclease VII at a 

5 concentration of 50u/nl in Taq DNA polymerase buffer was performed in a 
volume of 100ul. The digestion proceeded at 37°C for 30 minutes prior to 
incubation at 65°C for 15 minutes. 

Each digest was separated from low molecular weight solutes 
(Microcon-30; Amicon) with addition of dH 2 0 between episodes of 

10 centrifugation. The recovered volume in each case was divided between 
three tubes, each being digested either by 0.5u/nl Exonuclease I in 1x Taq 
DNA polymerase buffer, 1u/p.l T7 gene 6 exonuclease followed after heat 
inactivation at70°C for 10 minutes by 0.5u/\i\ Exonuclease I in 1x T7 gene 
6 exonuclease buffer, or 1u/ul T7 gene 6 exonuclease together with 1u/nl 

is S1 nuclease in 1x T7 gene 6 exonuclease buffer. The concentration of 
DNA in each reaction was approximately 0.1pmol4il contained within a 30ul 
volume. The Exonuclease I reactions were performed at 37°C for 15 
minutes prior to heat inactivation at 70°C for 10 minutes. The reactions 
containing T7 gene 6 exonuclease with or without S1 nuclease were 

20 performed at 37°C for 10 minutes. On completion of each regime of 
digestion the DNA was extracted (GFX purification column) and eluted in 
50>il dH 2 0. 

Three quarters of each of the extracted DNA samples was 
amplified by PCR with Taq DNA polymerase 
25 37.5jxl digested DNA 

i .... 

15|j.l 1 0x Taq DNA polymerase buffer 
15^1 10xdNTPs 

6|jlI 25pmpl/|al adapter primer 
ZaSnldHzOl 
30 1 50^1 j 



WO 98/42867 



PCT/GB98/00840 



-79- 

The reactions were divided into 75^il aliquots and overlaid 
with mineral oil to which were added 0.75^1 of 5ul\i\ Taq DNA polymerase 
after incubation at 95°C for 2 minutes. Amplification was achieved by 
thermal cycling for 25 repetitions of 95°C for 30s, 65°C for 30s, 72°C for 
5 90s, followed by a final extension of 72°C for 5 minutes. 

To each 150jal of amplified DNA were added 6\i\ 10u/jxl 
Exonuclease I. The reactions were incubated at 37°C for 15 minutes. 

The DNA in each case was separated from low molecular 
weight solutes (Microcon-30; Amicon) with addition of dH 2 0 between 
10 episodes of centrifugation. Repetition of hybridisation in 50mM NaCI and 
500^iM CTAB followed by each regime of digestion was repeated, followed 
by amplification of the resulting DNA by PCR with Taq DNA polymerase, as 
above. 

Aliquots of each of the amplified samples were subjected to 

15 electrophoresis on a 1.5% agarose gel stained with ethidium bromide with 
a molecular weight marker. The amplified products in the lanes 
corresponding to DNA digested by T4 endonuclease VII followed by 
Exonuclease I were of high molecular weight smearing towards the well. In 
contrast, the lanes corresponding to amplified product that had been 

20 digested by either T7 gene 6 exonuclease followed by Exonuclease I or T7 
gene 6 exonuclease concomitantly with S1 nuclease contained products 
ranging in molecular weights from approximately 200bp to 750bp. The 
distribution of molecular weights in each case was similar. No smearing 
towards the well was seen suggesting that the spurious products of 

25 amplification that were seen in the absence of T7 gene 6 exonuclease were 
eliminated by the presence of this enzyme. As such, T7 gene 6 
exonuclease was considered an essential component of the mis-match 
discrimination regime for removal of repeat sequ nces from T4 
endonuclease VII cleaved molecules that would otherwise cross-hybridise 

30 and produce spurious DNA molecules. . 

To each of the 150^1 volumes of amplified DNA resulting from 
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the second round of mis-match discrimination were add G\i\ of 10u/^l 
Exonuclease I and the reactions were digested at 37°C for 15 minutes. 

The DNA in each case was separated from low molecular 
weight solutes (Microcon-30; Amicon) with addition of dH 2 0 between 

5 episodes of centrifugation. 

For each of the reactions corresponding to the 'affected' dogs 
amplification was performed by PCR with Taq DNA polymerase using the 
VNTR specific primers in a volume of 50jal containing approximately 25ng 
DNA. Amplification by 28 repetitions of thermal cycling was performed 

10 after which aliquots and a molecular weight marker were loaded onto a 
2% agarose gel stained with ethidium bromide. 

For the lanes corresponding to digestion by T4 endonuclease 
VII and Exonuclease I the product of the expected molecular weight was 
very faint. In addition a large amount of spurious product in the vicinity of 

15 the wells was seen. For all other lanes no high molecular weight products 
were seen. Furthermore, the amplified products were seen clearly as a 
discrete band of the expected molecular weights of approximately 130bp. 

The products of amplification corresponding to digestion by 
T4 endonuclease VII and Exonuclease I were discarded. The remaining 

20 reactions were amplified further using the VNTR specific primers, one of 
which was labelled with [y-33P] ATP using T4 polynucleotide kinase. 
Amplification reactions were performed by PCR using Taq DNA 
polymerase in volumes of 20p.l containing 10pmoles of each primer for 35 
repetition of thermal cycling. In addition, reactions were performed in the 

25 same manner containing 40ng of the pooled 'affected 1 and pooled 'wild 
type 1 DNA. After addition of 10^1 of formamide loading dye to each sample 
the amplified products were denatured at 90°C for 3 minutes. 6^1 aliquots 
of the mixture were subjected to electrophoresis on an 8% polyacrylamide 
denaturing gel. The gel was fixed and dried and exposed to an 

30 autoradiography film. 

It was found that product was visible for DNA amplified from 
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affected DNA following the second round of mis-match discrimination. This 
was seen in both the lanes corresponding to digestion by T7 gene 6 
exonuclease followed by Exonuclease I and those corresponding to 
digestion by T7 gene 6 exonuclease concomitantly with S1 nuclease. In 

5 each case the product resembled that resulting from amplification of the 
pooled affected DNA that had not been subjected to mis-match cleavage. 
In the case of wild type DNA amplified after the second round of mis-match 
discrimination no products were discernible. 

This experiment confirmed that VNTRs are reproduced with 

10 fidelity from the pooled genomes of several individuals, the alleles in each 
case being preserved, and mis-match discrimination serves to eliminate 
spurious products of amplification and enrich the VNTR allele of the highest 
frequency. Although no products were visible for DNA derived from the 
wild type DNA, it may be that products would become visible with higher 

is loading of DNA on the polyacrylamide gel. As such, further repetition of the 
mis-match discrimination procedure would be necessary to reduce to near 
homozygosity the alleles in both DNA pools such that final selection of the 
informative allele could be achieved. 

20 Example 5 

Demonstration of the resistance to Exonuclease III of DNA with a 3 9 
overhang derived by ligation to an adapter. 

A cloned VNTR allele was amplified by Taq DNA polymerase. 
The amplified DNA was separated from low molecular weight solutes by 
25 microconcentration (Microcon-30; Amicon) with successive additions of 
dH 2 0 between episodes of centrifugation. 

The volume recovered was measured at 44^1, the 
concentration of which was determined by agarose gel electrophoresis to 
be 160ng/nl, approximating to 1.6pmol/}il. 
30 The amplified DNA was blunted by T4 DNA polymerase 

digestion; 
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42^1 DNA 
3.25^1 10mM dATP 
3.25^1 10mM dCTP 
3.25^1 10mM dGTP 
5 3.25^1 10mMdTTP 

1 3\x\ 1 0x T4 DNA polymerase buffer 
3.25|j.l 4u/^l T4 DNA polymerase 
5Q^i dH 2 0 
130fil 

io The reaction was incubated at 12°C for 30 minutes, then heat 

inactivated at 70°C for 20 minutes. The DNA was separated from low 
molecular weight solutes by microconcentration (Mtcrocon-30; Amicon) with 
successive additions of dH 2 0 between episodes of centrifugation. A 
volume of 30jil was recovered. 
15 1600pmoles of a 21mer oligonucleotide 

(CTCGCAAGGATGGGATGCTCG) were phosphorylated with T4 
polynucleotide kinase diluted to 10u/|al in the supplied dilution buffer: 
3.19nl 21mer oligonucleotide 
1.5jil 10xT4 DNA ligase buffer 
20 1 \x\ 1 Ou/jil T4 polynucleotide kinase 

flL3|4 dH 2 0 
15^x1 

The reaction was incubated at 37°C for 30 minutes, then heat 
inactivated at 90°C for 10 minutes. 
25 To the kinase reaction was added 1600pmoles of a 12mer oligonucleotide 
(CATCCTTGCGAG). Annealing of the oligos to form an adapter was 
achieved by heating to 55°C and allowing the mixture to cool to 10°C over 
a period of 1 hour. j 

Half of thepNA blunted by T4 DNA polymerase was saved. 
30 To the annealed adapter was added the remaining 15^1 of blunted DNA 
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such that the adapter was in a 50 fold excess: 
15jil blunted DNA 
16.2|il annealed adapter 
1.9^1 10xT4 DNA ligase buffer 
5 l^i 10u/^T4 DNA ligase 

34^1 

The ligation reaction was incubated over night at 16°C. 

The ligation was heat inactivated at 70°C for 20 minutes and 
the DNA was separated from low molecular weight solutes by 
10 microconcentration (Microcon-30; Amicon) with successive additions of 
dhbO between episodes of centrifugation. 

The volume recovered was measured to be 36|il. 

The ligated DNA and 15jal of non-ligated DNA that had been 
saved were both made to approximately 0.75pmoles/|al by addition of 
is dH 2 0. Each was digested by Exonuclease III at a final concentration of 
DNA approximating to 0.2pmol/jil: 

10.7nl DNA 

4\x\ 10x Exonuclease III buffer 
200u/fxl Exonuclease III 
20 24^iidH 2 0 

The reaction was incubated 37°C for 5 minutes then heat 
inactivated at 70°C for 20 minutes. 

Approximately 2pmoles of each digest were loaded onto a 2% 
25 agarose gel stained with ethidium bromide. All non-ligated DNA was 
digested to completion by Exonuclease 111 such that none was detectable 
on the agarose gel. In contrast, although some digestion had occurred, 
much of the ligated DNA was found to be resistant to digestion. That which 
had been digested was assumed to have failed to ligate to the 
30 phosphorylated adapter. This experiment confirmed that ligation of an 
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adapter is one method by which DNA molecules may become resistant to 
Exonuciease III digestion, those molecules lacking an adapter being 
digested to completion by this enzyme. 

5 Selection of unique sequences in a pool of DNA hybridised to a 
second pool of DNA using Exonuciease III. 

Two cloned VNTR alleles that differed in their repeat lengths 
by four nucleotides were amplified by PCR using Taq DNA polymerase. 
The amplified DNAs were separated from low molecular weight solutes by 
10 microconcentration (Microcon-30; Amicon) with successive additions of 
dH 2 0 between episodes of centrifugation and the resulting concentrations 
of DNA were determined by agarose gel electrophoresis. 

To a portion of the amplified products of the smaller allele 
was added a 3' overhang by incubation with Terminal deoxynucieotidyl 
15 transferase: 

12.5^1 120ng/jil DNA(approx. 1.2pmol/^l) 
1 5^1 5x Terminal deoxynucieotidyl transferase buffer 
1.125jxl 10mMdATP 
3.3^1 9u/^il Terminal deoxynucieotidyl transferase 
20 43^1 dH 2 0 

75^1 

The reaction was incubated at 37°C for 1 hour after which the 
DNA was extracted (GFX purification column). 

To 450ng of the allele possessing a 3' overhang was added: 
25 (i) 4.5^g of the same allele that lacked a 3' overhang; 

(ii) 4.5|ig of thejlarger allele that lacked a 3' overhang. 

In each case, the total volume was minimised by 
microconcentration (Micrdcon-30; Amicon). These mixtures were 
denatured at 98°C for 3 minutes and annealed at 75°C for 2 hours in the 
30 presence of 0.2M NaCI arjd 100^iM CTAB. 
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To each hybridisation reaction were added: 

1 0\x\ 1 0x Taq DNA polymerase buffer 

1 0jil 500u/|a! T4 endonuclease VII 

BQ^i dH 2 0 
5 100^1 

The reactions were incubate at 37°C for 45 minutes, then 
inactivated at 70°C for 15 minutes. 

The DNAs were separated from low molecular weight solutes 
by microconcentration (Microcon-30; Amicon) with successive additions of 
10 dhbO between episodes of centrifugation. In each case a volume of 

approximately 40^1 was recovered which was diluted in a reaction mixture 
containing 5u/pJ Exonuclease III: 

40|il DNA 

1 5jil 10x Exonuclease III buffer 
15 3.75nl 200u/nl Exonuclease III 

Mni dH 2 0 
150^1 

The reactions were incubated at 37°C for 5 minutes, after 
which they were microconcentrated (Microcon-30; Amicon). The entire 

20 recovered volumes were subjected to electrophoresis on a 1 .5% agarose 
gel stained with ethidium bromide. In addition, a molecular weight marker, 
400ng of the small allele without a 3' overhang, and 400ng of the smaller 
allele that possessed an overhang were loaded on to the geL 

The size of the smaller amplified allele was confirmed to be 

25 approximately 150bp by comparison to the molecular weight marker. After 
incubation with Terminal deoxynucleotidyl transferase the apparent size of 
this amplified allele had increased. A smear of products distributed over a 
range of sizes corresponding to between 400bp and 750bp of double 
stranded DNA was seen, though the majority of DNA was confined to an ill- 

30 defined band midwayjbetween these. In the lane containing hybridised 
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alleles of different sizes that had been digested, a band corresponding to 
approximately 300bp of double stranded DNA was seen against a back 
ground smear of products. This band was considered to be the result of 
enzymatic cleavage of the mis-match containing DNA duplexes, where as 

5 the back ground smear was considered to be single stranded DNA 

resulting from Exonuclease III digestion of molecules lacking the protection 
of a 3' overhang. In the lane that contained hybridised alleles of the same 
size two ill-defined bands were visible against a background smear of 
products. The brightest band was of an appearance similar to that of the 

io smaller allele following its incubation with Terminal deoxy nucleotidyl 

transferase and was considered to represent the remaining single stranded 
DNA from heteroduplex molecules digested by Exonuclease III. The fainter 
band was considered to the result of enzymatic cleavage of molecules 
possessing polymerase errors. As before, the background smear was 

is considered to be due to single stranded DNA of molecules lacking a 3' 
overhang that had resulted from digestion by Exonuclease III. This 
experiment suggests that an allele possessing a 3' overhang entering into a 
heteroduplex with an allele of a different repeat length is digested by T4 
endonuclease VII and Exonuclease III such that a fragment of the 

20 heteroduplex may be selected. 
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Appendix 



Consider a scenario that may typify a rare recessive trait. The affected 
group of individuals are homozygous for the same allele. In the wild type 
group, this allele has a relatively low frequency. 





Starting scenario 




Affected 






Wild Type 






Alleles 


A 


B C 


D 


A 


B C 


D 


10 


Allele frequencies 


1.0 


0.0 0.0 


0.0 


0.15 


0.35 0.2 


0.3 




Allele ratios 


1 


0 0 


0 


3 


7 4 


6 




After 1 st Round 




Affected 






Wild Type 






Alleles 


A 


B C 


D 


A 


B C 


D 


15 


Amount remaining 


1.000 


0.000 0.000 


0.000 


0.023 


.0.123 0.040 


0.090 




Total remaining 


1.0 






0.276 








Allele ratios 


1 


0 0 


0 


23 


123 40 


90 




Allele frequencies 


1.000 


0.000 0.000 


0.000 


0.083 


0.446 0.145 


0.326 


20 


After 2 nd Round 




Affected 






Wild Type 






Alleles 


A 


B C 


D 


A 


B C 


D 




Amount remaining 


1.0 


0.0 0.0 


0.0 


0.006 


0.199 0.021 


0.106 




Total remaining 


1.0 






0.332 








Allele ratios 


1 


0 0 


0 


6 


199 21 


106 


25 


Allele frequencies 


1.0 


0.0 0.0 


0.0 


0.018 


0.599 0.063 


0.319 




After 3 rd Round 




Affected 






Wild Type 






Alleles 


A 


B C 


D 


A 


B C 


D 




Amount remaining 


1.0 


0.0 0.0 


0.0 


0.000 


0.359 0.004 


0.102 


30 


Total remaining 


10 






0.465 








Allele ratios 


1 


0 0 


0 


0 


359 4 


102 




Allele frequencies 


1.0 


0.0 0.0 


0.0 


0.000 


0.772 0.008 


0.219 
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After 4 th Round 




Affected 






Wild Type 




Alleles 


A 


B 


c 


D 


A 


B C 


D 


Amount remaining 


1.0 


0.0 


0.0 


0.0 


0.000 


0.596 0.000 


0.010 


Total remaining 


1.0 








0.606 






Allele ratios 


1 


0 


0 


0 


0 


596 0 


10 


Allele frequencies 


1.0 


0.0 


0.0 


0.0 


0.000 


0.983 0.000 


0.017 



Comparison of the 1 x1 x1 x1=1 0.276 x 0.332 x 0.465 x 0.606 = 0.026 
ratios of remaining 
10 alleles 38.5:1 

all of which is A none of which is A 

Therefore, even If an large excess of wild type DNA is 
hybridised to the affected DNA that survives the mis-match discrimination 
15 procedure it is extremely likely that the allele present in the affected group 
will be recovered. 

Consider another scenario in which one allele is present in 
the affected group of individuals at a frequency greater than that of the wild 
type group. 

20 

Starting scenario 
Alleles 

Allele frequencies 
Allele ratios 

25 

After 1 st Round 
Alleles 

Amount remaining 
Total remaining 
30 Allele ratios 

Allele frequencies 



Affected 
ABODE 

0.050 0.100 0.000 0.150 0J00 
1 2 0 3 14 

Affected 
A B C D E 

0.003 0.010 0.000 0.023 0.490 
0.526 

3 10 0 23 490 
0.006 0.019 0.000 0.044 0.932 



Wild Type 
ABODE 

0.250 0.200 0.150 0.250 0.150 
5 4 3 5 3 

Wild Type 
A B C D E 

0.063 0.040 0.023 0.063 0.023 
0.212 

63 40 23 63 23 
0.297 0.189 0.108 0.297 0.108 
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After 2 nd Round Affected Wild Type 

Alleles ABODE ABODE 

Amount remaining 0.000 0.000 0.000 0.002 0.869 0.088 0.036 0.012 0.088 0.012 

Total remaining 0.871 0.236 

5 Allele ratios 0 0 0 2 869 22 9 3 22 3 

Allele frequencies 0.000 0.000 0.000 0.002 0.998 0.373 0.153 0.051 0.373 0.051 

After 3 rd Round Affected Wild Type 

Alleles ABODE ABODE 

10 Amount remaining 0.000 0.000 0.000 0.000 0.996 0.139 0.023 0.003 0.139 0.003 

Total remaining 0.996 0.307 

Allele ratios 0 0 0 0 1 139 23 3 139 3 

Allele frequencies 0.000 0.000 0.000 0.000 1.000 0.453 0.075 0.010 0.453 0.010 

1 5 After 4 th Round Affected Wild Type 

Alleles ABODE ABODE 

Amount remaining 0.000 0.000 0.000 0.000 1.000 0.205 0.006 0.000 0.205 0.000 

Total remaining 1.0 0.416 

Allele ratios 0 0 0 0 1 205 6 0 205 0 

20 Allele frequencies 0.000 0.000 0.000 0.000 1.000 0.493 0.014 0.000 0.493 0.000 

Comparison of the 0.526 x 0.871 x 0.996 x 1 = 0.4560.212 x 0.236 x 0.307 x 0.416 = 0.006 
ratios of remaining 

alleles 76 : 1 

25 M of which is E none of which is E 

Therefore, even if an large excess of wild type DNA is 
hybridised to the affected DNA that survives the mis-match discrimination 
procedure it is extremely likely that allele E present in the affected group 
30 will be recovered. 
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CLAIMS 

5 1 . A method of making a mixture of VNTR alleles and their 

flanking regions of the genomic DNA of one or more members of a species 
of interest, which method comprises the steps of: 
a ) dividing genomic DNA of the species of interest into 

fragments, 

io b) ligating to each end of each fragment an adaptor thereby 

forming a mixture of adaptor-terminated fragments in which each 3'-end is 
blocked to prevent enzymatic chain extension, 

c) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR primer to 

15 create a mixture of 5-flanking VNTR amplimers, 

d) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR antisense 
primer to create a mixture of 3-flanking VNTR amplimers, 

e) and using genomic DNA of the one or more members of the 
20 species of interest as template with the mixture of 5-flanking VNTR 

amplimers and/or the mixture of 3-flanking VNTR amplimers as primers to 
make the desired mixture of VNTR alleles and their flanking regions. 

2. The method of claim 1 , wherein step b) is performed by 
terminating each 3'-end of each fragment to prevent enzymatic chain 

25 extension, and ligating each 5'-end of each fragment to an adaptor, thereby 
forming a mixture of adaptor terminated fragments. 

3. The method of claim 1 or claim 2, wherein in step c) the 
VNTR repeat sequences are removed from the 5'- flanking VNTR 
amplimers, and in step d) the VNTR repeat sequences are removed from 

so the 3'- flanking VNTR amplimers. 
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4. The method of any one of claims 1 to 3, wherein in step c) 
and/or d) the adaptor or primer used contains at least one 
phosphorothioate bond. 

5. The method of any one of claims 1 to 4, wherein step e) is 
5 performed using as primers, either successively or together, both the 

mixture of 5*- flanking VNTR amplimers and the mixture of 3 - flanking 
VNTR amplimers. 

6. The method of any one of claims 1 to 5, wherein there is used 
in step e) genomic DNA of one or more members of the species of interest 

10 which manifest a trait of interest, whereby the resulting mixture of VNTR 
alleles and their flanking sequences is representative of those which 
manifest the trait of interest. 

7. The method of claim 6 wherein in a step f) the strands of the 
mixture of VNTR alleles and their flanking regions are separated and then 

is re-annealed and any mis-matches are separated and discarded. 

8. The method of claim 7, wherein step f) is repeated to recover 
a single VNTR allele and its flanking regions. 

9. The method of any one of claims 6 to 8, wherein at least one 
VNTR allele and its flanking sequences representative of those which 

20 manifest the trait of interest, is hybridised with a mixture of VNTR alleles 
and their flanking sequences representative of those which do not manifest 
the trait of interest, and at least one match and/or at least one mis-match is 
selected to provide at least one VNTR allele or fragment thereof which is 
characteristic of the trait of interest. 

25 10. The method of claim 9, wherein the at least one VNTR allele 

and its flanking sequences representative of those which manifest the trait 
of interest, is provided with 3 - overlapping ends. 
11. A portion of genomic DNA of one or more members of a 

species of interest, said portion consisting essentially of a representative 

30 mixture of alleles of a chosen VNTR sequence and their flanking regions. 
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12. The portion as claimed in claim 1 1 , wherein the mixture of 
alleles is representative of those which manifest a trait of interest 

13. The portion as claimed in claim 1 1 or claim 12, wherein each 
member of the mixture has an adaptor at each of its 3-end and its 5-end. 

5 1 4. A portion of genomic DNA of one or more members of a 

species of interest, said portion consisting essentially of a single VNTR 
allele and its flanking regions and an adaptor at each of its 3'-end and its 
5-end, said allele being characteristic of those which manifest a trait of 
interest. 

io 1 5. A portion of genomic DNA of a species of interest, said 

portion consisting essentially of a representative mixture of 3-flanking 
regions of a chosen VNTR sequence, each member of the mixture carrying 
an adaptor at its 3'-end. 

16. A portion of genomic DNA of a species of interest, said 
15 portion consisting essentially of a representative mixture of 5'-flanking 

regions of a chosen VNTR sequence, each member of the mixture carrying 
an adaptor at its 5'-end. 

17. A method of treating a mixture of polymorphic alleles, the 
mixture being representative of those which manifest a trait of interest, 

20 which method comprises separating and then re-annealing strands of the 
mixture, and separating and discarding any mis-matches. 

1 8. The method of claim 17, wherein the mixture of polymorphic 
alleles is a mixture of alleles of a chosen VNTR sequence and their flanking 
regions. 

25 1 9. The method of claim 1 8, wherein the method is repeated to 

recover a single VNTR allele and its flanking regions. 
20. The method of any one of claim 17 to 19, wherein at least one 

VNTR allele and its flanking sequence representative of those which 
manifest the trait of interest, is hybridised with a mixture of VNTR alleles 

30 and their flanking sequences representative of those which do not manifest 
the trait of interest, and at least one match and/or at least one mis-match is 

i : 
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selected to provide at least one VNTR allele or fragment thereof which is 
characteristic of the trait of interest. 

21 . The method of claim 20, wherein the at least one VNTR allele 
and its flanking sequence representative of those which manifest the trait of 

5 interest, is provided with ^-overlapping ends. 

22. A method of making a mixture of amplimers which method 
comprises the steps of: 

a) dividing genomic DNA of one or more members of a species 

of interest into fragments, 
io b) Hgating to each end of each fragment an adaptor thereby 

forming a mixture of adaptor-terminated fragments in which each 3-end is 
blocked to prevent enzymatic chain extension, and 

c) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR primer to 

15 create a mixture of 5-flanking VNTR amplimers, and/or 

d) using a portion of the mixture of adaptor-terminated 
fragments as templates with an adaptor primer and a VNTR antisense 
primer to create a mixture of 3-flanking VNTR amplimers. 

23. A method of identifying an allele which is linked to a trait of 
20 interest, which method comprises incubating together under hybridisation 

conditions: at least one polymorphic allele and its flanking sequences 
representative of those which manifest the trait of interest; and a mixture of 
polymorphic alleles and their flanking sequences representative of those 
which do not manifest the trait of interest; and selecting at least one match 
25 and/or at least one mis-match to provide at least one allele or fragment 
thereof which is linked to the trait of interest. 

24. The method of claim 23, wherein the alleles are VNTR alleles. 

25. The method of claim 23 or claim 24, wherein the at least one 
allele and its flanking sequences representative of those which manifest the 

30 trait of interest, is provided with 3 - overlapping ends. 
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26. Use of the portion of genomic DNA as claimed in claim 14 in 
a diagnostic assay. 

27. The method of any one of claims 1 to 10 or 17 to 21 , wherein 
the VNTR allele and its flanking regions, or the mixture of VNTR alleles and 

5 their flanking regions, is analysed by being applied under hybridisation 
conditions to an array of immobilised VNTR alleles and/or their flanking 
regions. 

28. A kit comprising protocols and reagents for performing the 
method of any one of claims 1 to 10 r 17 to 25 or 27. 



